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ALTERED LIN OLENIC AND LINOLEIC ACID 

CONTENTIN PLANTS 
This is a continuation-in-part of U.S. Serial No. 08/156,551 
filed November 22, 1993, which is a continuation of U.S. Serial No. 
5 08/014,431, filed on Februaiy 5, 1993. The present invention relates to 
genetically engineered plants. In particular it relates to genetically 
engineered plants and seeds which have altered linolenic and linoleic acid 
content compared with naturally occurring plants. 
BACKGROUND 

10 Many crop species produce seed oils in which the fatty add 

composition is not ideally suited to the intended use. The application of 
conventional breeding methods, coupled in some cases with mutagenesis, 
has resulted in the production of new varieties of several species with 
desirable alterations in the fatty acid composition of seed oil: A notable 

15 example is the development of low erucic acid varieties of rapeseed 
(Stefansson 1983). Similar efforts have resulted in the reduction of the 
level of polyunsaturated 18-carbon fatty acids in soybean (Wilcox and 
Gavins 1985; Graef et al. 1988), sunflower (Fick 1989), and linseed oils 
(Green and Marshal 1984). 

20 Most of the genetic variation in seed lipid fatty acid 

composition appears to involve the presence of an allele of a gene that 
disrupts normal fatty acid metabolism and leads to an accumulation of 
intermediate fatty acid products in the seed storage lipids (Downey 1987). 
However, it seems likely that, because of the inherent limitations of this 

25 approach, many other desirable changes in seed oil fatty acid composition 
may require the directed application of genetic engineering methods. 

a-Linoienic acid (18:3 ^-^^JS) an eighteen carbon fatty acid 
containing three cis double bonds at the 9-10, 12-13 and 15-16 carbons. It 
is foimd in the cells of higher plants as a constituent of cell membranes. It 
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is also found in storage organs, such as in seeds. There it is designated oil 
bodies which are bounded by an electron dense structure that is thought to 
be a half-unit membrane and dispersed in the cytoplasmic environment of 
cells. When present as a constituent of cell membranes, hnolenic add is 
5 usually esterified to the sn-1 or sn-2 position of the glycerol moiety of a 
diacyl-glyceroUpid. By contrast, when present in oil bodies, linolenic add is 
usually esterified to the sn-1, sn-2 or sn-3 position of a triacylglycerolipid 
(TAG). 

Linolenic add is extensively used in the paint and varnish 
10 industry in view of its rapid oxidation. Flax seed is a predominant source of 
this oil. Soybean seed, on the other hand, does not have suiBcient linolenic 
add content to be used in this industry. Thus, increasing the linolenic add 
content in a plant such as soybean would permit the use of the soybean oil 
in the paint and varnish industiy. 

On the other hand, it is xmdesirable to have significant levels 
of linolenic add in cooking oils and foods. Linolenic add is unstable during 
cooking and is rapidly oxidized. The oxidized products impart randdily to 
the finished product. A rapeseed or soybean oil with reduced linolenic add, 
such as containing 2% or less of linolenic add, would be ideal for use as a 
20 cooking oil. 

Linolenic add is also a precursor in the biosynthesis of 
jasmonic acid, an important plant growth regulator. Linolenic add is 
converted to jasmonic add by introduction of an oxygen to the carbon chain 
by a lipoxygenase, followed by dehydration, reduction, and several (J- 
25 oxidations (Vick and Zimmerman, 1984). The activity of jasmonic add has 
been measured in terms of induction of pathogen defense responses. By 
application of free linolenic acid to plants, plant pathogen defenses can also 
be induced (Farmer and Ryan, 1992). 
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A model has been proposed to explain the ability of free 
linolenic acid to exhibit the effects associated with jasmonic add (Farmer 
and Ryan, 1992). It is hypothesized that all of the enzymatic activities 
which are reqidred for the conversion of Hnolenic add to jasmonic add are 
5 constitutively present in the cell and the rate limiting step in the production 
of jasmonic add is the availability of free linolenic add. A likely route for 
the production of the free linolenic add is by the activity of a lipase in the 
plasma membrane. 

It has been observed that exogenous jasmonic add can more 

10 powerfully activate defense responses than can woimding. This suggests . 
that woimds cannot generate enough free linolenic add to support high level 
production of jasmonic add. The activity of the lipase or the availability of 
appropriate substrate for the lipase may be rate limiting upon wounding. 
Thus» increasing the linolenic acid content of plaisma membrane may 

15 positively influence "'signal transduction'' in plants and residt in better 
protection against environment and pathogen stress. 

Linolenic add, as well as oleic and linoleic acids are also 
important constituents, as well as precursors of volatile carbonyl 
compounds, whic contribute to the aroma of both fresh and cooked foods. 

20 The major fatty adds of tomato fruit pericarp are oldc, linoleic and linolenic 
acids. As the fruit ripens, the levels of the latter two fatty adds decline 
resulting in the production of a number of 4-6 carbon containing aldehydees 
and ketones. One particular metabohte, cis-3-hexanol, has been shown to 
be present in higher levels in vine-ripened tomatoes compared to 

25 supermarket tomatoes or tomatoes stored in refrigerators. It is likely, 
therefore, that the ^'aroma" of fresh fruits and vegetables can be 
"'modulated'' by regulation of the content of linolenic and linoleic acids, 
important substrates for the enz3rme Upoxygenase and subsequently the 
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hydroperoxide cleaving enzyme, which generates the volatile "aroma'' 
compounds. 

From the above, it is clear that the ability to vary the content 
of linolenic acid in plants would be desirable. However, to achieve this 
5 result it is necessary to determine what controls the product of linolenic 
add in plants. 

A large body of experimental evidence derived from 
radiochemical tracer studies has indicated that a-linolenic acid is 
63aithesized by the desaturation of hnoleic acid (18:2^9»12) (reviewed in 
10 Harwood 1988;). However, the actual substrate for desaturation is not 
known. 

In vivo and in vitro labelling studies suggest that there are 
possibly two distinct pathways for the ssmthesis of linolenic acid (Browse 
and Somerville, 1991). One possible pathway is thought to be located in the 

15 endoplasmic reticulum where linoleic add esterified to the sn-2 position of 
phosphatidylcholine is a substrate for desaturation. However, the 
available evidence does not exclude the possibility that linoleic acid 
esterified to other lipids may also be a substrate. 

A second possible pathway of linoleic add desaturation is 

20 located in the plastid where the available evidence suggests that linoleic 
add esterified to monogalactosyldiacylglycerol and, possibly, other plastid 
lipids is the substrate for desaturation. 

Relatively little direct information is available concerning the 
enz3nnes involved in linoleic acid desaturation. Low levels of enzyme 

25 activity have been detected in microsomal membrane preparations fi'om 
developing hnseed (Lintim ussitatxun) (Browse and Slack, 1981) and, more 
recently, in preparations of gently lysed chloroplasts (Schmidt and Heinz, 
1990a,b). The general features of the enzyme may be inferred fi^om 
information available about other enzjmies of this class. 
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The most thoroxighly characterized desattirase is the stearoyl- 
Coenz3nne A (CoA) desattirase from vertebrate liver (reviewed by 
Holloway, 1983). This eii23rme has been shown to be an integral membrane 
protein which contains non-heme iron. The desatm^ase reaction reqviires 
5 fatty acyl-CoA, molecular oxygen and reduced cytochrome b5, another 
membrane protein. In vivo , the reduced cytochrome b5 is produced by the 
transfer of reducing equivalents from NADH via the activity of 
cytochrome b5 reductase, a flavin containing membrane protein. 

The most thoroughly characterized desaturase from plants is 
10 the stearoyl-ACP desaturase (McKeon and Stumpf, 1982; Shanklin and 
Somerville, 1991). This enzyme also requires molecular oxygen and a high 
potential reductant. However, in contrast to the animal enzyme, this 
" desaturase is a soluble plastid protein which preferentially acts on a fatty 
add esterified to acyl carrier protein (ACP) rather than CoA. This enzyme^' 
15 also differs from the animal enzyme by utilizing reduced ferredoxin as £ui 
intermediate electron donor. 

Other plant desaturases appear to be membrane proteins. 
The microsomal A12 oleate desaturase from several plant species has been 
assayed in membrane preparations from several plants (Harwood, 1988). 
20 As with the stearoyl-CoA desaturase from animals, this enzyme requires 
molecular o^gen and reducred cytochrome b5 as an electron donor (Keams 
et al., 1991). However, it appears that oleate esterified to a phosphoUpid is 
the substrate rather than a CoA ester. 

With regard to the activity responsible for the making of 
25 hnolenic add, httle was known as to its sotirce or origin. However, evidence 
that the amount of linolenic acid is related to the amount of hnoleic acid 
desaturase activity has been obtained by analysis of the properties of the 
fads mutant of Arabidopsis thaliana (Lemieux et al. 1990). This mutant is 
deficient in linolenic add in the storage oils of its seed lipids and in the 
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membrane lipids of diflferent tissues to varying degrees. The mutant also 
had an increase in the amount of linoleic acid. This can be interpreted as 
evidence that the mutant is defective in the activity of a desaturase which 
converts linoleic add to hnolenic add. 
5 There is further evidence to suggest that the activity of this 

desaturase could be rate limiting for linolenic add synthesis under normal 
circumstances. This was discovered by measining the efifects on fatty add 
composition in heterozygous plants (i.e., fad3+/fad-) formed by crossing the 
wild type with the fad3 mutant. In these Fl plants, which have one copy of 
10 the normal fad3 gene product instead of the two normally foxmd in the wild 
type, the amount of linolenic add was almost exactly intermediate between 
that found in either parent This suggests that the amount of linolenic add 
is proportional to the amount of functional fadS gene product (Lemieux et 
al., 1990). 

15 These results do not shed any light, however, on the nature of 

the fads gene product or whether the observed effects in mutants are 
related to either a decrease in quantitiy of desaturase protein or desaturase 
activity due to a defective protein. 

Moreover, nothing is known with any degree of certainty 
20 about the linoleic add desaturase from plant microsomes. As noted above, 
very little is known about the microsomal desaturases except that they 
probably utilize reduced cytochrome b5 as intermediate electron donor and 
probably utilize hpids rather than CoA or ACP esters as substrates. 

Moreover, as in many other aspects of plant biology, the lack 
25 of specific information about the biochemistry and regulation of lipid 
metaboHsm makes it difficult to predict how the introduction of one or a few 
genes might usefully alter seed lipid synthesis. 

An additional problem arises from the fact that many of the 
key enzymes of lipid metabolism are membrane-boimd and present in low 
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quantities. Thiis, attempts to solubilize and purify them from plant 
sources have not been successful. 
SUMMARY OF THE INVENTION 

The present invention provides structural coding sequences 
5 encoding linoleic acid desaturase activity which can be used to alter the 
linoleic and linolenic add compositions of plants or to isolate other plant 
linoleic acid desaturases. The present invention further provides a plant 
capable of expressing a structiu-al coding sequence to control the level of 
linolenic add or Hnoleic acid or both in the plant. The present invention 

10 further provides a method for controlling the levels of linoleic and linolenic 
add in plants. It is also demonstrated by the present invention that the 
linoleic acid desaturase enzyme activity in plant cells and tissues is a 
controlling step in linolenic add biosynthesis. 

The present invention further relates to the engineering of two 

15 advantageous traits into plants: increased and decreased a*linolenic add 
content in the structural lipids or storage oils of various crop plants. 

In accomplishing the foregoing, there is provided, in 
accordance with one aspect of the present invention, a genetically 
transformed plant which has an elevated linolenic acid content comprising 

20 a recombinant, double-stranded DNA molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably Unked to; 

(ii) a structural coding sequence that causes the 
25 production of an RNA sequence that encodes a linoleic 

acid desaturase activity; and 

(iii) a 3* non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 
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In accordance with another aspect of the present invention, 
there is provided a genetically transformed plant which has a reduced 
linolenic acid content, comprising a recombinant, double-stranded DNA 
molecule comprising 
5 (i) a promoter that functions in plant cells to cause 

the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a DNA sequence that causes the production of an 
RNA sequence that is in antisense orientation to at least 

10 a portion of a gene that encodes a linoleic add desaturase 

activity in said plant; and 

(iii) a 3' non-translated region that functions in plant 

cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

15 There has also been provided, in accordance with another aspect 

of the present invention a method of producing a genetically transformed 
plant which has an elevated or reduced linolenic add content. There has 
also been provided, in accordance with another aspect of the present 
invention a recombinant, double-stranded DNA molecule and plant cells 

20 containing a recombinant, double-stranded DNA molecule. 
BRIEF DESCRIPTTON OF THE DRAWTNflR 

Figure 1 shows the genetic map of the region of chromosome 2 of 
Arabidopsis thaliana where a linoleic acid desaturase gene is located and 
the identity of the yeast artificial chromosomes which carry this region of 

25 the genome. 

Figure 2 shows the structure of plasmid pBNDES3 which was 
obtained by inserting an EcoRI fragment containing the JB. napus linoleic 
acid desaturase cDNA (fadS) into pBLUESCRIPT. 
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Figure 3 shows the nucleotide sequence (SEQ ID NO:l) and 
deduced amino acid sequence (SEQ ID NO:2) for the linoleic add desaturase 
cDNA (fadS) from B. napus. 

Figure 4 shows a comparison of the deduced amino add sequence 
5 of one linoleic add desaturase cDNA (fadS) from B. napus and the desA 
gene from Synechocystis. Identical residues are indicated hy a solid box. 
Conservative substitutions are indicated by a stippled box. 

Figure 5 shows the binary Ti plasmid vector pBI121. 

Figure 6 shows the binary Ti plasmid pTiDESS which was 
10 constructed by insertion of a linoleic add desaturase cDNA (fadS) into 
pBI121. - 

/ Figure 7 shows the map of the plant transformation vector 
pMONl3804. 

Figure 8 .shows the map of the. plant transformation vector 
15 pMON13805. 

Figure 9 shows the oil content of control and transformed canola 
seed in accordance with the present invention. 

Figure 10 shows the nucleotide sequence (SEQ ED NO:9) for the 
Unoleic add desaturase cDNA (fadD) from Arabidopsis. 
20 Figure 11 shows the deduced amino acid sequence (SEQ ID 

NO:10) for the linoleic acid desaturase cDNA (fadD) frora Arabidopsis. 

Figure 12 shows the nucleotide sequence (SEQ ID NO:ll) for the 
Unoleic add desatiu^ase cDNA (fadE) from Arabidopsis. 

Figure 13 shows the deduced amino add sequence (SEQ ID 
25 NO:12) for the linoleic add desaturase cDNA (fadE) from Arabidopsis. 
DETAILED DESCRIPTION OF THE INVENTION 

A genetically transformed plant of the present invention which 
has an altered linolenic or linoleic acid content can be obtained by 
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expressing the double-stranded DNA molecules described in this 
application. 

The expression of a double-stranded DNA involves transcription 
of messenger RNA (mRNA) from one strand of the DNA by RNA 
5 polymerase enzyme, and the subsequent processing of the mKNA primary 
transcript inside the nucleus. This processing involves a 3' non-translated 
region which adds polyadenylate nucleotides to the 3' end of the RNA. 
Promoters 

Transcription of DNA into mRNA is regulated by a region of 
10 DNA usually referred to as the "promoter." The promoter region contains a 
sequence of bases that signals RNA polymerase to associate with the 
DNA, and to initiate the transcription of mRNA using one of the DNA 
strands as a template to make a corresponding complementary strand of 
RNA. 

15 Any promoter which is known or is found to cause transcription 

of RNA in plant cells can be used in the present invention. Promoters 
which are useful in the present invention include any promoter that 
functions in a plant cell to cause the production of a RNA sequence. A 
nimiber of promoters which are active in plant cells and are capable of 

20 producing a RNA sequence have been described in the hterature. These 
include the nopaline synthase (NOS) and octopine synthase (OCS) 
promoters (which are carried on ttmior-indudng plasmids of Agrobacterium 
tumefaciens), the caulimovirus promoters such as the cauliflower mosaic 
virus (CaMV) 19S and 35S and the figwort mosaic virus 35S-promoters, 

25 the light-inducible promoter from the small subimit of ribulose-l,5-bis- 
phosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide), 
and the chlorophyll a/b binding protein gene promoter, etc. All of these 
promoters have been used to create varioiis types of DNA constructs 
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which have been expressed in plants; see, e.g., PCT publication WO 
84/02913 (Rogers et al., Monsanto). 

Promoters may be obtained from a variety of soiirces such as 
plants and plant viruses. Promoters can be used in the form that they 
5 exist as isolated from plant genes such as ssRUBISCO genes, or can be 
modified to improve their effectiveness, such as with the enhanced 
CaMVSSS promoter. 

Those skilled in the art will recognize that the amount of linoleic 
acid desaturase needed to induce the desired alteration in linolenic acid 
10 content may vaiy with the type of plant. It is also possible that extremes 
in linoleic add desaturase activity may be deleterioxis to the plant.- 
Therefore, in a preferred embodiment, promoter function should be 
optimized by selecting a promoter with the desired tissue expression ' 
capabilities and approximate promoter strength and selecting a 
15 transfprmant which produces the desired Unoleic add desaturase adivity.ih 
the target tissues. . - 

This selection approach from the pool of transformants is 
routinely employed in expression of heterologous structural genes in plants 
since there is variation between transformants containing the same 
20 heterologous gene due to the site of gene insertion within the plant genome. 
(Commonly referred to as "position effect"). 

In a preferred embodiment, the promoters utiHzed in the double- 
stranded DNA molecules should have relatively high expression in tissues 
where the increased or decreased linolenic acid content is desired, such as 
25 the seeds of the plant. In Canola, a particularly preferred promoter in this 
regard is the seed specific promoter described herein in greater detail in the 
accompanjdng examples. 

In another preferred embodiment, the promoter used in the 
expression of the double*stranded DNA molecules of the present invention . 
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can be a constitutive promoter, expressing the DNA molecule in all or most 
of the tissues of the plant. However, the promoter selected for this 
embodiments should not cause expression at levels which are detrimental 
to plant health, growth and development. 
5 fl-conglycinin (also known as the 7S protein) is one of the major 

storage proteins in soybean (Glycine max) (Meinke et al., 1981). The 7S (p- 
conglydn) a-subimit promoter, used in one aspect of this study to express 
the linoleic add desaturase gene, has been shown to be both highly active 
and seed-specific (Doyle et al, 1986 and Beachy et al., 1985). The B-subxmit 
10 of B-conglydnin has been expressed, using its endogenous promoter, in the 
seeds of transgenic petunia and tobacco, showing that the promoter 
functions in a seed-specific manner in other plants (Bray et al., 1987). The 
promoter for B-conglycinin could be used to in accordance with the present 
invention. If used, this promoter could express the DNA molecule 
15 specifically in seeds, which could lead to an alteration in the linolenic add 
content of the seeds.. 

In addition, the endogenous plant linoleic acid desaturase 
promoters can be used in the present invention. These promoters should be 
useful in expressing a linoleic acid desaturase gene in specific tissues, such 
20 as leaves, seeds or fruits. A number of other promoters with seed-specific 
or seed-enhanced expression are known and are likely to be expressed in 
seeds, which are oil acomiulating cells. For illustration, the napin promoter 
and the acyl carrier protein promoters have been utilized in the 
modification of seed oil by antisense expression (Knutson et al., 1992). 
25 The linolenic acid content of root tissue can be increased by 

expressing a linoleic acid desaturase gene behind a promoter which is 
expressed in roots. The promoter from the acid chitinase gene (Samac et 
al., 1990) is known to function in root tissue and could be used to express 
the linoleic acid desaturase in root tissue. Expression in root tissue could 
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also be accomplished by utilizing the root specific subdomains of the 
CaMV35S promoter that have been identified. (Benfey et aL, 1989): The 
linolenic acid content of leaf tissue can be increased by expressing the 
linoleic acid desaturase gene using a leaf active promoter such as 
5 ssRUBISCO promoter or chlorophyll a/b binding protein gene promoter. . 

The linolenic acid content of fruits can be increased by 
expressing a linolenic acid desaturase gene behind a promoter which is 
functional in fruits. Such promoters could be either expressed at all 
developmental stages of the fruit or restricted to specific stages, 

10 particularly finit ripening. 

The RNA produced by a DNA construct of the present invention- - 
can also contain a 6' non-translated leader sequence..: This sequence can be 
derived from the promoter selected to express the gene, and can be 

' specifically modified so as to increase translation , of the mRNA. The 5' - 

15 non-translated regions can. also be obtained from viral RNAs, from suitable 
eukaryotic genes, or from a synthetic gene sequence. The present 
invention is not limited to constructs, as presented in the following 
examples, wherein the non-translated region is derived from the 5' non- 
translated sequence that accompanies the promoter sequence. Rather, the 

20 non-translated leader sequence can be derived from an unrelated promoter 
or coding sequence as discussed above. 
linoleic Acid Desatu rase Structural Coding Sequences 

The structural coding sequence that causes the production of an 
RNA sequence that encodes a linoleic acid desaturase activity can be the 

25 sequences disclosed in the present application, or any sequence that can be 
obtained using the sequences disclosed in the present application, or any 
sequence that can be isolated using the method disclosed in the present 
appUcation. 
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The structural coding sequence can also be a part of or from the 
structural coding sequences disclosed in the present invention. It is possible 
that the active part of the linoleic add desatiu*ase is formed using only part 
of the structural coding sequences disclosed in the present appUcation. 
5 The structural coding sequences can be obtained from a variety 

of sources, such as algae, bacteria or plants. Preferably, structural coding 
sequences obtained from plants are used in accordance with the present 
invention. 

Since virtually nothing was known about the properties of the 
10 linoleic acid desaturase structural coding sequence prior to the present 
invention, the method used in the present invention to isolate the structural 
coding sequence was based on the concept of map based cloning. The 
essential concept in map based cloning is to use information about the 
genetic map position of a structural coding sequence to isolate the region of 

15 the chromosome surrounding the structural coding sequence, and then to 
use the isolated DNA to complement a mutation in the structural coding 
sequence. This strategy has never previously been reported in the isolation 
of any plant gene. 

In order to implement map based cloning of the linoleic acid 

20 desaturase, mutants of Arabidopsis thaliana (L.) deficient in linoleic acid 
desaturase activity were isolated by screening randomly chosen individuals 
from mutagenized populations of plants for individual plants with altered 
leaf or seed fatty acid composition. (Browse et al. 1985; Lemieux et al. 
1990). By screening thousands of plants for altered fatly acid composition, 

25 mutants with decreased amoimts of linolenic add and increased amounts of 
linoleic add in leaf and seed lipids were isolated. Physiological and genetic 
analyses of these mutants indicated that they fell into three 
complementation groups designated fadS, fadD and fadE. 
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The fads mutants had very reduced levels of linolenic add in 
seeds and roots but had almost normal levels of linolenic acid in leaves. 
This effect was interpreted as evidence that the fadS locus encoded a 
microsomal desaturase which was responsible for desaturation of linoleic 
5 acid to linolenic add on lipids made by the pathway of lipid biosjoithesis in 
the endoplasmic reticulum, designated the ''eukaiyotic pathway" (Lemieux 
et al. 1990). This pathway is mostly responsible for the sjmthesis of lipids 
in non-green tissues such as seeds and roots, but plays a secondary role in 
leaves and other green tissues. Thus, a mutation in the fadS gene would not 

10 be expected to have a major effect on the desaturation of leaf lipids. 

In contrast to the-fadS mutant, -the fadD mutant had almost 
normal fatty acid composition^ of roots and seeds, but -had a strong 
reduction in the amount of linolenic add in leaf lipids, and a corresponding 
increase in the anioimt of linoleic add. (Browse et £d., 1986). Thiis, this 
. 15; mutantyhad the properties expected of a mutant defident m a linoleic add 
desaturase from the prokaxyotic pathway which is primarily-'resp'onsible 
for the S3mthesis of lipids in green tissues. 

An imusual property of the fadD mutants was that they were 
very deficient in linoleic add content when grown at temperatures above 

20 about 22 'C but had almost normal fatty add composition when grown at 
temperatures below about 18 *C (McCourt et al., 1987). Since it was very 
unhkely that several independently isolated mutations would all give rise to 
a temperature conditional phenotype, it was concluded that a second 
desaturase must be partially responsible for desaturating linoleic acid to 

25 linolenic acid in green tissues. Therefore, the fadD mutant was 
remutagenized with ethylmethane sulfonate, self-fertilized to produce a 
segregating population of mutagenized plants (designated the M2 
generation), and this population was screened for a mutant which was 
deficient in linolenic acid in green tissues at low temperatiu^es. A mutant 
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with this property was isolated and the mutation responsible for this eflfect 
was designated the fadE locus (Somerville and Browse, unpublished). 
Isolation of the Linoleip Add Desaturase Genft from naT^^ ]^ 

The following example was used to isolate the structural coding 
5 sequence from the fad3 region. The method described herein could equally 
have been used to isolate either the fadD or fadE region. 

In order to approximately locate the fad3 mutation of the genetic 
map of Arabidopsis, a sexual cross was made between the fad3 mutant line 
BLl and the multiply marked mutant line Wl (Hugly et al., 1991). The Fl 
10 hybrids from this cross were permitted to self-fertilize and the resulting P2 
plants were scored for both the segregating genetic markers and the altered 
fetty add composition. The results of this aiialysis indicated that the £ad3 
mutation was located on chromosome 2 near the marker erecta. In order 
to obtain a more accurate map position by RFLP mapping, a second sexual 
15 cross was made between the fad3 mutant line BLl and the Niederzenz 
race of Arabidopsis. The Fl progeny were permitted to self-fertilize to 
produce the F2 generation. 137 F2 plants were grown diiring 3 weeks at 22* 
C (100 nE/m2/s) in order to produce fully expanded rosettes, and a few 
leaves (representing a total weight of 0.2-0.5 g per plant) were harvested 
20 from each plant in order to prepare DNA fi«m them. 

The leaves were frozen in liquid nitrogen, and ground in dry ice, 
using a mortar and a pestle. For each sample, the frozen powder was 
transferred to a microfuge tube and an equal amount of 2 X CTAB hvtffer 
(2% cetyltrimethyl ammonium bromide (CTAB), 100 mM Tris-HCl pH 8, 
25 20 mM EDTA, 1.4 M NaCl, 1% polyvinylpolypyrroUdone (PVP) 40,000) was 
added. The tubes were left at room temperature for 5 min to allow the 
powder to thaw. The homogenate was extracted once with a mixture of 
chloroform-isoamyl alcohol (24:1, v/v), and 1/10 vol of 10 X CTAB (10 % 
CTAB, 0,7 M NaCl) buffer was added to the aqueous phase, which was then 
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reextracted with an equal volume of chloroform isoamyl alcohol (24:1, v/v). 
The aqueous phase was transferred to a fresh microfuge tube and 1.5 vol of 
CTAB precipitation buffer (1% CTAB, 50 mM Tris-HCl pH 8, 10 mM 
EDTA) was added. The DNA was allowed to precipitate for 12 hr at 4 
5 degrees, and collected by centrifugation (5 min at 10 OOOg). The DNA was. 
resuspended in 100 nl of 10 mM Tris-HCl pH 7.5, 1 mM EDTA, 1 M NaCl, 
and 100 jig/ml RNase A and incubated at 50'C for 30 min. The DNA was 
precipitated by adding 2.2 vol of ethanol and incubating on ice for 20 min. 
The DNA was collected by centrifugation and the pellet was washed once 

10 with 1 ml of 70% ethanol, dried under vacuimi for 3 min and resuspended in 
lOiil ofdistOledwater. The DNA was stored at -20*C imtil use. 

-The 137 plants were grown to maturity and their seeds were 
. collected- individually. The fatty-acid composition of 10 individual seeds 
from each of the F2 plants was measured as described by-Browse- et-al .; 

15 (1986) in order to score the fad3 phenotype of each plant. Each seed was 
incubated in 1 ml of IN HCl in methanol for Ih at^O'C. The tubes were 
cooled to room temperature and 1 ml of 0.9 % NaCl plus 0.3 ml of hexane 
were added. The tubes were agitated by vortexing and the phases separated 
by centrifugation (300xg for 5 min). The hexane phase was saved, 

20 evaporated imder a stream of nitrogen, and the fatty acid methyl esters 
were dissolved in 50 pi hexane. An aliquot (2 pi) was injected onto the gas 
chromatograph and the fatty acid methyl esters separated and quantitated 
by flame ionization as described (Browse et al., 1986). 

The DNA samples (1 \ig) were then cut with the appropriate 

25 restriction enzyme (EcoRl for the marker # 220, Bgl2 for the marker 
ASA2) using a concentration of IXKGB buffer (Sambrook et al, 1989), 5 
units of the restriction endonuclease and 100 jig/ml BSA. The volume of 
each sample was 10 [il and the incubation was performed at 37 *C for 4 h. 
The fragments were resolved by agarose gel electrophoresis (0.8 % agarose 
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10 



in IX TAE buflfer; Sambrock et al., 1989) and transferred to nylon filters 
(hybond N+), using the alkaUne transfer method as described by the 
manufacturer. The nylon filters were probed (according to Church and 
Gilbert, 1984) with radioactively labeUed fi-agments of DNA (Sambrock et 
al., 1989) corresponding to known RPLP markers which had previously 
been mapped in the approximate vicinity of the fadS locus on chromosome 
2. The RFLP markers 220 (Chang et al 1988) and ASA2 were found to 
map close to the fad3 locus. Analysis of the pattern of recombinants 
(Table 1) indicated that both ASA2 and 220 were located on the same side 
of the fade locus at distances of 0.4 and 2.2 centimorgans (cM), 
respectively. 

Table 1 



15 



20 



# of olants 


220. 


ASA2 




67 


H 
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+/- 


30 


L 
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v- 


34 


N 


N 


■♦✓+ 
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N 


•fA- 
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L 


H 


■»/. 


1 


N 


H 


■»✓- 


1 


H 


H 





Table 1 shows the genotype of the F2 plants used for mapping 
the fad 3 locus. L is for Landsberg (background of the fad 3 mutant), N is 
for Niederzenz, H for heterozygous. A total of 137 F2 plants were analyzed. 
25 The number of recombinant plants between fad3 and 220 or ASA2 was 6 
and 1 respectively. 

In order to isolate the region of the chromosome containing the 
fad3 locus, the RFLP markers 220 and ASA2 were used as hybridization 
probes to screen several yeast artificial chromosome (YAC) Hbraries. (Grill 
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and Somerville, 1991; Ward and Jen, 1990). The YAC filters were prepared 
according to Grill and Somerville (1991). The library was replicated onto 
nylon filters disposed on petri dishes of SC — (synthetic complete medium 
minus tryptophan and uracil; Sherman et al., 1986). The cells were aUowed 
5 to grow for 12 h at 30*C, and the filters were transferred for 15 min on a 
Whatman 3MM paper saturated with 1 M sorbitol, 50 mM DTT, 50 mM 
EDTA(pH8). 

The cell wall of the cells was then digested with Ijrticase, by 
incubating the filters on a WhatmEUi paper saturated with IM sorbitol, 50 
10 mM EDTA and 2 mg/ml lyticase (Sigma Co., St. Louis,MO) for 12 h at 
30*G. — The filters were then transferred on a Whatman 3MM paper 
saturated with 0.5 M NaOH, 1.5 M NaCl for 15 min, neutralized with 0,5 M 
Tris-HCl pH 8 for 15 min and quickly rinsed in 2XSSe (SSC is lOmM 
sodium, catrate, 150mM NaGl, pH 7). The filters were allowed to dry, ^ and 
15 were transferred to -a vacuum oven at 80*e for 1 h. They were 
subsequently hybridized according to Church and Gilbert (1984), with 
probes labelled with 32p according to Sambrook et al. (1989). 

The DNA of RFLP probe 220 was prepared fi-om 100 ml of liqtiid 
culture lysate using the lambdasorb procedttre (Promega Corp., Madison, 
20 WI); the cDNA encoding ASA2 was excised fi-om the original plasmid 
(pKNl40C; obtained from Dr. G. Fink, Whitehead Institute, Cambridge, 
MA) with Hinds and cloned into the ffindS site of pBLUESCRIPT, The 
plasmid DNA was then purified by Cesiimi chloride gradients according to 
Sambrook et al (1989), digested with HindS and the DNA insert was gel 
25 purified twice by electroelution according to Sambrook et al (19iB9). 

In order to probe the libraries, the whole DNA from RFLP220 
was used as a hybridization probe. By contrast, only the DNA insert of 
ASA2 was used as a probe. The RFLP probe 220 hybridized to YAC 
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EG4E8 and EG9D12. The probe ASA? hybridized to YACs EW15G1, 
EW15B4 and EW7D11. 

In order to determine if these YACs contained all of the DNA 
between RFLP220 and ASA2, small regions of DNA from the ends of the 
5 inserts in EG4E8 and EW15G1 were prepared by inverse PGR (Grill and 
Somerville, 1991). For that purpose, DNA was prepared from the 
appropriate YAC clones. The clones (single colonies) were grown to 
saturation in SO — liquid cultures, and 1 ml of these cultiu'es was used to 
inoculate 40 ml liquid cultures (in SO — mediimi) that were allowed to grow 

10 for 16 h at 30*C. The cells were collected by centriiugation, washed once in 
1 M sorbitol, 50 mM EDTA, resuspended in 200 ^l of 1 M sorbitol, 50 mM 
EDTA, 100 mM sodiiun citrate pH 5.8, 2 mM P-mercaptoethanol and 2 
mg/ml lyticase, and incubated 2 h at 30 *C. 

Next, 350 ^1 of 2XCTAB buffer was added and the DNA was 

15 purified as described above. DNA (5 ^g) of each clone was digested 
separately with HincU, Alul, EcoRV and Rsal (in IXKGB buffer, at 37 'C 
for 4 h; final volume: 50 pi). The reactions were stopped by heating at 65 
*C for 15 min, extracted once with one volume of phenol saturated with TE 
pH 8, followed by an extraction with 1 volume of chloroform - isoamyl 

20 alcohol mixture (24:1, vol/vol). The DNA was recovered by ethanol 
precipitation and resuspended in sterile distilled water. The ligation 
reactions were performed using 300 ng of DNA in a final volume of 50 \iL 
The reactions were carried out in 50 mM Tris-HCl pH 7.4, 10 mM MgC12, 1 
mM DTT,1.2 mM ATP with 1 U of Ugase, for 2 h at 20 'C, and stopped by 

25 heating at 68 'C for 30 min. 

The PGR reactions were carried out as follows: The buffers used 
were the ones indicated by the suppliers except for the Perkin Elmer 
enzyme for which the reaction was supplemented with an additional 1.4 
mM MgCl2 (final concentration 2.9 mM Mg). The dNTP final concentration 
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was 125 when the Perkin Ehner enzyme was used and 200 with the 
Taq polymerases from other sources. In all cases, 100 ng of each 
oligonucleotide was used. The final volume was 100 pi. When no product 
was obtained, the reactions were carried out again in the same conditions 
5 except that formamide was added to a final concentration of 3 %. 

The left end was amplified fi*om the ligation products of the 
EcoRV and Rsal digests, using the oligonucleotides EGl 
(GGCGATGCTGTCGGAATGGACGATA) (SEQ, ID NO. 3) and EG2 
(CTTGGAGCCACTATCGACTACGCGATC) (SEQ, ID NO. 4). 
10 The right end of the clones obtained from the EG library was 

amplified from the ligation products of the Aliiland HincII digests, using., 
the oUgonucleotides EG3 (CCGATCTCAAGATTACGGAAT) (SEQ. ID NO. 
. 5) and EG4 (TTCCTAATGCAGGAGTCGCATAAG) (SEQ. ID NO. 6). 

- The right end-df the clones obtained firom the EW YAC library 
15 was^ampjified using the oligonucleotides HI (AGGA6TGGCATAAGGGAG) 
, (SEQ, m NO. 7) and H2, (GGGAAGTGAATGGAGAC) (SEQ. ID NO. 8), 
using the same cycle conditions as above, except that the annealing 
temperature was reduced to 50 *C. 

After the reactions were completed, 5\il of each mixture were 
20 electrophoresed on an agarose gel to separate the amplification product 
fi'om primers. The slice of agarose that contained the amplified band was 
. excised firom the gel and melted in 1 ml of distilled water. Large amotmts of 
product could then be produced, by reamplification of 5 |ji of the melted 
slice. The PGR products were then purified by electroelution or by using 
25 GeneClean (Bio 101) and used as hybridization probes to probe filters 
containing the isolated YAC DNA restricted by several enzjrmes. The 
probe made from the right end of EW15G1 hybridized to EG4E8 and 
similarly, a probe from the right end of EG4E8 hybridized to EW15G1. 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 PCT/US94/01321 



.22- 



Thus, it was concluded that the YACs EG4E8 and EW15G1 contained all of 
the DNA in the region of the chromosome between RFLP220 and ASA2. 

The size of the YAC clones was estimated by field inversion 
electrophoresis (CHEF, Vollrath and Davis, 1987), High molecular weight 
5 DNA was prepared as follows: the yeast cells which contained the YAC 
clones were grown and treated with lyticase as for preparing DNA as 
described above. The spheroplasts were then resuspended in an equal 
volume of IM sorbitol, 50 mM EDTA, 1 % low melt agarose at 37'C. The 
mixture was poured in a mould (Biorad) which was set on ice to allow the 

10 agarose to harden. 

The resulting plugs were incubated for 12 h in 0.5 M EDTA pH 9, 
1%^ lauryl sarcosine 1 mg/ml Proteinase K at 50*C. The plugs were 
subsequently washed twice in 50 mM EDTA and stored at 4*0 until use. 
The CHEF gel was run in IXTBE for 16 h at 200 V, with a switching 

15 interval of 20 s; the temperature of the buffer was maintained at 14 *C 
during the run. The sizes of the YACs were determined by comparison with 
a lambda ladder and the yeast chromosomes, and were as follows: EG4E8, 
90 kb; EG9D12, 190 kb; EW15G1, 90 kb; EW15B4, 70 kb, EW7D11, 125 
kb. These sizes permitted us to roughly determine a correspondence 

20 between physical and genetic distances: the distance that separates 220 
from ASA2 cannot exceed 180 kb, the sum of the size of the 2 YACs 
EG4E8 and EW15G1. Since the corresponding genetic distance is 1.7 cM, 
one can roughly estimate that, in this particular cross and in this particular 
region of the genome, the value of 1 cM is close to lOOkb. Thus, since the 

25 fads gene maps only 0.4 cM away from ASA2, the corresponding physical 
distance should be close to 40 kb. We then concluded that fadS was 
probably located on the YAC EW7D11, which is the largest YAC 
hybridizing with ASA2. See Figiire 1. 
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In order to test the possibility that the YAC EW7D11 carried 
the fads gene, the YAC was used to probe a cDNA library made from 
developing seeds of Canola (Brassica napus L.). Even though the YAC was 
isolated from Arabidopsis, the fact that Arabidopsis and B. napus are botii 
5 members of the family Cruciferae led us to predict that the homologous 
genes from these two species would be sufQciently identical at the 
nucleotide sequence level so that the Arabidopsis gene woiild hybridize to 
the B. napus gene. We also assumed that, because it catalyzes a 
chemically similar reaction to the stearoyl-ACP desaturase, it would be 

10 expressed at similar moderately high levels in developing seeds (Shanklin 
and Somerville, 1991). Since EW7D11 contained only about 0.2% of the 
total genome, we expected it to contain only about 2 moderately 
abundantly expressed genes (i.e., genes in which the mRNA is between 0.1 
arid 0.01% of total mRNA). : ^ 

15 " DNA of YAC EW7D II was isolated as follows: high moled^^ 

weight DNA was prepared from the yeast cells that contained the YAC 
EW7D11 as described above, and several preparative low-melt agarose 
CHEF gels were rim in IXTBC buffer (same as TBE except that CDTA 
was substituted for EDTA). The sUces that contained the YAC were excised 

20 from the gels and pooled. Three slices were melted at 65*C and extracted 
with an equal volimie of phenol saturated with TE. The aqueous phase was 
saved and reduced to 0.5 ml by repeated extractions with isobutyl alcohol. 
The remaining agarose was removed by several phenol extractions, followed 
by two chloroform-isoamyl alcohol extractions. The DNA was precipitated 

25 by adding 2 ^g of linear acrylamide as a carrier plus 10 ^il of 5M NaCl and 
1.1 ml of ethanol, and incubating 20 min at 0 'C. The DNA pellet was 
recovered by centrifugation, washed in 70% ethanol, dried under vacuvun 
and resuspended in 50 jil of distilled water. The DNA (50 ng) was 
radioactively labelled and used to probe a cDNA library in Xgtll. 
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The nitrocellulose filters were processed as described in 
Sambrook et al (1989). Duplicate filters were used, and the films were 
exposed 5-7 days in order to obtain a good signal. From among 200,000 
plaques screened in this way, 31 hybridized to EW7D11. Among these 31 
5 clones, 17 were homologous to each other, as checked by cross 
hybridization in stringent conditions. The size of the inserts in the 17 clones 
was estimated and the clone with the largest cDNA was retained for 
fiirther analysis. A small scale preparation of this phage was prepared 
vising the lambdasorb method, and the insert was excised by restricting 

10 with EcoRl. This insert was ligated into a pBLUESCRIPT II vector 
linearized with EcoRI, and the ligation mixture was used to transform E. 
coli strain DHSo. 

One of the recombinant clones was designated pBNDESS 
(Figure 2), and retained for sequencing. ~ The sequence was determined on 

15 both strands, using the sequenase enzyme, (US Biochemicals, Cleveland, 
OH) according to the instructions provided by the suppUer. The nucleotide 
sequence of the insert in pBNDESS is presented as Figure 3. The deduced 
amino acid sequence of the largest open reading frame in the nucleotide 
sequence is also shown in Figure 3. 

20 Comparison of the deduced amino acid sequence of the 383 

amino acid open reading frame in clone pBNDESS against the known 
sequences in GenBank release 70 was performed using the FASTA 
program (Lipman and Pearson, 1985). This analysis revealed that the 
sequence from pBNDESS had a region of significant homology to a 

25 previously characterized desaturase gene from the cyanobacterium 
Synechocystis (Figure 4). (Wada et al. 1990). This was considered 
suggestive evidence that the clone pBNDESS encoded a desaturase which 
was probably the fadS structural coding sequence product. This was 
subsequently confirmed by a genetic complementation experiment. 
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The cDNA was cloned into plant transformation vector pBI121 
(Figure 5) under the control of the CaMV35S promoter to construct 
pTiDESS (Figure 6). Plasmid pTiDESS was introduced into an 
Agrobacterium tumefaciens istrain which also carried an Ri plasmid and this 
5 was used to produce transgenic rooty tumors from both wild type 
Arabidopsis and the fadS mutant. Transgenic tissue was selected for 
antibiotic resistance to confirm the presence of the pTiDESS. Fatty acid 
methyl esters were then prepared and examined by gas chromatography to 
determine the profile of fatty adds being produced in the tissue. The levels 

10 of linolenic acid increased, demonstrating that the cDNA on pTiDESS can 
complement the fadS mutetion. These results, which are described in detail 
in Example I below, confirm the identity of the cDNA as encoding a linoleic 
^.add desaturase. 

The isolation of a plant structural coding sequence provides 

-15 those skilled in the art with a tool for the manipulation of gene expression 
by the mechanism of antisense RNA. The^technique of antisense KNA is 
based upon introduction of a chimeric gene which will produce an RNA 
transcript that is complementary to a target gene (reviewed in Bird and 
Ray, 1991). The resulting phenot3rpe is a reduction in the gene product 

20 from the endogenous gene. The portion of the gene which is suflBcient for 
achieving the antisense effect is variable in that numerous fragments or 
combinations thereof are likely to be effective. Various portions of the 
structural coding sequence of linoleic acid desaturase isolated either from 
cDNA or genomic clones are likely capable of redudng linolenic add levels in 

25 plants by reduction in levels of linoleic acid desaturase levels. An example 
of using an antisense oriented linoleic acid desaturase structviral coding 
sequence is set out in Example 2. 
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PolY^de^Y][ation Sign^ 

The 3' non-translated region of the double stranded DNA 
molecule of the present invention contains a region that functions in plant 
cells to promote polyadenylation to the 3' end of the RNA sequence. Any 
5 such regions can be used within the scope of the present invention. 
Examples of suitable 3* regions are (1) the 3* transcribed, non-translated 
regions containing the polyadenylated signal of Agrobacterium tumor- 
indudng (Ti) plasmid genes, such as the nopaline synthase (NOS) gene, and 
(2) 3' regions of plant genes like the soybean storage protein genes and the 
10 small subunit of the ribulose-l,5-bisphosphate carboxylase (ssRUBISCO) 
gene. An example of a preferred 3* region is that from the NOS gene, 
described in greater detail in the examples below. 

Flftnt Tran^fQrmatiiQP/RgggT^CTatiQP 

Any plant which can be transformed to contain the double- 

15 stranded DNA molecule of the present invention are included within the 
scope of this invention. Preferred plants which can be made to have 
increased or decreased linolenic acid content by practice of the present 
invention include, but are not limited to sunflower, safflower, cotton, com, 
wheat, rice, peanut, canola/oilseed rape, barley, sorghum, soybean, flax, 

20 tomato, almond, cashew and walnut. 

A double-stranded DNA molecule of the present invention 
containing the functional plant linoleic add desatiirase gene can be inserted 
into the genome of a plant by any suitable method. Suitable plant 
transformation vectors include those derived from a Ti plasmid of 

25 Agrobacterium tumefacienSy as well as those disclosed, e.g., by Herrera- 
Estrella (1983), Bevan (1984), Klee (1985) and EPO publication 120,516 
(Schilperoort et al.). In addition to plant transformation vectors derived 
from the Ti or root-inducing (Ri) plasmids of Agrobacterium ^ alternative 
methods can be used to insert the DNA constructs of this invention into 
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plant cells. Such methods can involve^ for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free DNA 
delivery via microprojectile bombardment, and transformation using 
bacteria, viruses or pollen. 
5 A plasmid expression vector, suitable for the expression of the 

linoleic acid desaturase gene in monocots is composed of the following: a 
promoter that is specific or enhanced for expression in the lipid storage 
tissues and a 3' polyadenylation sequence such as the nopaline synthase 3' 
sequence (NOS 3'; Fraley et al., 1983). This expression cassette may be 

10 assembled on high copy replicons suitable for the production of large 

quantities of DNA 

A particularly iiseful:i4^o6actermm-based plant transform 
vector for use in transformation of dicotyledonous plants is plasmid vector 
pMONSSO (Rogers, S.G.,. 1987). Plasmid pMON530 (see Figure 7) is a 

15 derivative of pMONSQS prepared by transferring the 2.3 kb Stul-Hindlll 
fragment of pMON316 (Rogers, S.G., 1987) into pMON526.- Plasmid 
pMON526 is a simple derivative of pMON505 in which the Smal site is 
removed by digestion with Xmal, treatment with Klenow pol3rmerase and 
ligation. Plasmid pMON530 retains all the properties of pMON505 and the 

20 CaMV35S-NOS expression cassette and now contains a xmique cleavage 
site for Smal between the promoter and polyadenylation signal. 

Vector pMON505 is a derivative of pMON200 (Rogers, S.G., 
1987) in which the Ti plasmid homology region, LIH, has been replaced with 
a 3.8 kb Hindlll to Smal segment of the mini RK2 plasmid, pTJS75 

25 (Schmidhauser & Helinski, 1985). This segment contains the RK2 origin of 
replication, oriV, and the origin of transfer, oriT, for conjugation into 
Agrobacterium using the tri-parental mating procedure (Horsch & Klee, 
1986). Plasmid pMON505 retains all the important features of pMON200 
including the synthetic multi-linker for insertion of desired DNA fragments, 
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the chimeric NOS/NPTIF/NOS gene for kanamycin resistance in plant 
cells, the spectinomycin/streptomycin resistance determinant for selection 
in E. coli and A. tumefaciensy an intact nopaline S3aithase gene for facile 
scoring of transformants and inheritance in progeny and a pBR322 origin of 
5 replication for ease in making large amounts of the vector in E. coli. 
Plasmid pMON505 contains a single T-DNA border derived from the right 
end of the pTiT37 nopaiine-type T-DNA Southern analyses have shown 
that plasmid pMON505 and any DNA that it carries are integrated into the 
plant genome, that is, the entire plasmid is the T-DNA that is inserted into 

10 the plant genome. One end of the integrated DNA is located between the 
right border sequence and the nopaline S3mthase gene and the other end is 
between the border sequence and the pBR322 sequences. 

When adequate numbers of cells (or protoplasts) contaixiing the 
linoleic acid desatiurase gene are obtained, the cells (or protoplasts) are 

15 regenerated into whole plants. Choice of methodology for the regeneration 
step is not critical, with suitable protocols being available for hosts from 
Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (carrot, celery, 
parsnip), Cruciferae (cabbage, radish, rapeseed, etc.)» Cucurbitaceae 
(melons and cucimiber), Gramineae (wheat, rice, com, etc.), Solanaceae 

20 (potato, tobacco, tomato, peppers) and various floral crops. See, e.g., 
Ammirato (1984); Shimamoto, -1989; Fromm, 1990; Vasil and Vasil, 1990. 
Uses of Linoleic Acid Desaturase 

The present invention can be used for any modification (either 
increase, decrease, or mere change) of the oil content of a plant or plant 

25 tissue. Linolenic acid is an important constituent of several membranes in 
plant cells. 

One preferred method is to modify the oil content of the plant to 
improve the plant's temperature sensitivity. For instance, plants deficient 
in linolenic acid display reduced fitness at low temperature (Hugly and 
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Somerville, 1992). Also, increased linoleic add content in vegetative tissues 
has been implicated as a factor in freezing tolerance in higher plants 
(Steponkus et al., 1990 and references therein). In a preferred 
embodiment, egression of the linoleic acid desaturase structural coding 
5 sequence can result in the genetic modification of higher plants to achieve 
tolerance to low environmental temperatures. Transfonnation with 
pTiDESS demonstrates that linolenic acid levels can be increased by 
expression of this gene in a constitutive manner. Chilling or freezing ixyury 
in crops may be overcome by expression of this gene in vegetative or 

10 reproductive tissues by employing an appropriate promoter. 

Linolenic acid, a pol3mnsaturated fatty add, is also extensively^ 
used in the paint and varnish industry in view of its rapid oxidation. Flax 
seed is a predominant source of this oil. Higher quantities of this fatty add 
_ in rapese^d or soybean will provide opportunities for using vegetable-oils 

15 from these sources as a replacement for Unseed (fl Expression of a ; 

linoleic acid desaturase structural coding sequence in seed tissue can result 
in a higher proportion of linolenic add in the storage oil. 

Linolenic acid is further a precursor in the bios3mthesis of 
jasmonic acid, an important plant growth regulator. Linolenic acid is 

20 converted to jasmonic acid by introduction of an o^gen to the carbon chain 
by a lipoxygenase, followed by dehydration, reduction, and several P- 
oxidations (Vick and Zimmerman, 1984). The activity of jasmonic add has 
been measured in terms of induction of pathogen defense responses. By 
application of free linolenic acid to plants, plant pathogen defenses can also 

25 be induced (Farmer and Ryan, 1992). A model has been proposed to explain 
the ability of free linolenic acid to exhibit the effects associated with 
jasmonic acid (Farmer and Ryan, 1992). It is hjrpothesized that all of the 
enzjrmatic activities which are required for the conversion of Unolenic add 
to jasmonic acid are constitutively present in the cell and the rate Umiting 
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step in the production of jasmonic acid is the availability of free linolenic 
acid. A likely route for the production of the free linolenic acid is by the 
activity of a lipase in the plasma membrane. 

It further has been observed that exogenous jasmonic acid can 
5 more powerfully activate defense responses than can woimding. This 
suggests that wounds cannot generate enough free linolenic acid to support 
high level production of jasmonic acid. The activity of the lipase or the 
availability of appropriate substrate for the lipase may be rate limiting 
upon wounding. By increasing levels of available substrate, increasing 

10 linolenic acid levels in the plasma membrane, it should be possible to 
enhance a plant's ability to respond to pathogens by allowing for a higher 
production of jasmonic acid. Expression of a linoleic acid desaturase 
structural coding sequence can result in a higher molar percent Unolenic 
acid in the plasma membrane of a plant cell therefore enhancing the 

15 jasmonic acid signaling pathway. It is our intent to evaluate plants 
containing high linolenic acid levels in root and foliar tissues for their 
pathogen resistance. 

It is also xmdesirable to have significant levels of linolenic acid in 
cooking oils. Linolenic add is imstable during cookihg and is rapidly oxidized. 

20 The oxidized products impart rancidity to the finished product. Rapeseed or 
soybean oil containing less than about 3%, and preferably 2% or less of 
linolenic add is ideal for use as a cooking oil. By expression of the antisense 
of the structural coding sequence for linoleic acid desaturase, it is possible 
to reduce the linolenic acid content of these oils. 

25 All higher plants have linolenic add and, therefore, contain genes 

for linoleic acid desaturases. Because of the many examples in which genes 
isolated from one plant species have been used to isolate the homologous 
genes from other plant species, it is apparent to any one skilled in the art, 
that the results presented here do not only pertain to the use of the B. 
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napus fads gene, or to the use of the gene to modify fatty acid composition 
in B. napus. Obviously, the linoleic add desaturases from many organisms 
could be used to increase linolenic acid biosynthesis and accumulation in 
plants and enzymes from any other higher plant or algae can serve as 
5 sources for linoleic acid desaturase genes. For example, since a YAC 
containing the Arabidopsis gene was used to isolate the £. napus gene, it is 
apparent that the insert in pBNDESS could be used as a probe of genomic 
libraries for isolation of the corresponding full length genes from other plant 
species. It is also likely that the information contained in the sequence of 

10 this gene will be iiseful to done other lipid desaturases genes. 

Expression of a linoleic add desaturiase in a sense orientation 
may also allow for the isolation of plants with reduced leviels of linolenic 
add. This could be accomplished by the mechanism of co-suppression (Bird 
and Ray, - 1991). The molecular mechanism of co-suppression is" ^t this 
-15 time poorly imderstood Hut occurs when plants are transformed with a gene 
that is identical or highly homologous to an allele foimd in the plants 
genome. There are several examples where expression of a chimeric gene in 
plants can result in a reduction of the gene product from both the chimeric 
gene and the endogenous gene(s). Those skilled in the art will recognize that 

20 the resulting decrease in linolenic add would be a direct result of expression 
of the linoleic acid desaturase structural coding sequence and would be 
correlated to the hnoleic add desaturase activity in the transformed plant. 

Linolenic acid levels in plant cells can also be modified by 
isolating genes encoding transcription factors which interact with the 

25 upstream reg^ulatory elements of the plant linoleic add desatux^ase gene(s). 
Enhanced expression of these transcription factors in plant cells can effect 
the expression of the hnoleic acid desaturase gene. Under these conditions, 
the increased or decreased hnolenic add content would also be caused by a 
corresponding increase or decrease in the activity of the linoleic add 
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desaturase enzyme although the mechanism is different. Methods for the 
isolation of transcription factors have been described (Katagiri, 1989). 

The following examples are provided to better elucidate the 
practice of the present invention and should not be interpreted in any way 
5 to limit the scope of the present invention. Those skilled in the art will 
recognize that various modifications, tnmcations, etc. can be made to the 
methods and genes described herein while not departing fi-om the spirit and 
scope of the present invention. 
Example 1 

10 Expression of fad3 gene tp increase linolenic add 

To verify the assimiption that the cDNA insert in pBNDESS 
encodes a linoleic acid desaturase, both wild type and fad3 mutation 
Arabidopsis were transformed to contain the cDNA insert. In order to 
express the linoleic acid desaturase structural coding sequence (hereafter 

15 referred to as the 'YadS gene**) in plant cells, the plasmid pBNDES3 was 
digested with Xhol and the ends were filled in with the Klenow fragment of 
DNA polymerase (Sambrook et al 1989). The cDNA insert was 
subsequently excised by digestion with Sacl and ligated into the Sacl and 
Smal sites of the binary Ti plasmid vector pBI121 (Clontech 

20 Laboratories), thereby replacing the GUS reading frame. The ligation 
reaction was carried out in 20 ^il for 12 h at 16 using 100 ng of both 
insert and vector, and one unit of T4 DNA ligase. The ligation mixtxire was 
used to transform competent DH5a E, coli cells (prepared by the calcium 
chloride method, according to Sambrook et al, 1989), and transformants 

25 were selected on L-broth plates that contained 50 ^ig/^l Kanamycin, 
Alkaline minipreparations of recombinant clones were analyzed for the 
correct restriction pattern. One of these plasmids, designated pTiDES3, 
was used for further experiments. 
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This plasznid was electroporated (according to Mersereau and 
Pazour, 1990) into Agrobacterium tumefaciens strain RIOOO which carries 
an Ri plasmid. The transformed bacteria were selected on kanamycin LB 
plates for 2 days at 30 *C. DNA minipreparations of several recombinant 
5 bacteria were performed and analyzed as described above to verify the 
presence of the construct. 

Yoimg flowering stems of wild type and the fad3 mutant of 
Arabidopsis were sterilized for 30 min in 10% commercial bleach, 0.02% 
Triton XlOO, and 2-cm ezplants that contained the flowering stem were 
10 infected with RIOOO (pTiDES3) This was performed by dipping the 
sectioned extremity in a drop of an overnight culture of the appropriate 
Agrdbitcterium - that was grown from a single colony in LB medium 
supplemented with 50 u^nl Kanamydnv * 

The-infected stems were cultured for two days on-solid MSO 
* 15 medium (Gibco MS salts plus Gamborg B5 vitamins, 3% sucrose and 0.8% 
- agar). At this time the stem segments were transferred for 5 weeks to 
MSO medium containing 200 ^g/ml cefotaxime to kill the bacterium. After 
approximately two weeks, most of the stem explants had developed rooty 
ttimors resulting from transfer of parts of the Ri plasmid into cells of the 
20 stem explants. In order to identify the rooty ttunors which had also 
received the binary Ti plasmid pTiDES3, approximately 24 rooty tumors 
from each treatment were transferred to MSO medivun containing 50 pg/ml 
of kanamycin to select for the growth of those roots which had been 
cotransformed with the binary Ti plasmid; the medium contained also 200 
25 \xg/ml of cefotaxime to inhibit bacterial growth. Following a further period of 
growth for 2 weeks, fatty acid methyl esters were prepared (as described 
above) from the roots for analysis by gas chromatography. The results of 
these analyses are presented in Table 2. 
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Table2. Genotvnfi 



mol% wildtype fad 3 wildtype fadS 



5 



Fatty add 


pBI121 


pBI121 


pTiD£S3 


pTiDESS 


16:0 


22.0±2.9 


21.2±1.6 


21.1±0.9 


21.3±2.3 


16:1 


2.5±0.7 


1.6±0.8 


2.0±0.1 


1.5±0.2 


18:0 


2.3±1.9 


2.3+1.9 


1.910.2 


1.6±0.4 


18:1 


3.8±1.3 


5.9±2.6 


7.7±2.0 


9.1±2.0 


18:2 


37.3±3.7 


62.2±5.9 


15.7±11.7 


24.4±14.9 


18:3 


31.9±4.5 


6.7±0.7 


51.3±10.9 


42.1±15.5 



Table 2 shows the fatty acid composition of transgenic roots. 
The transgenic roots resulting from infection of wild type or the fad3 
mutant with A tumefaciens RIOOO carrying the vector (pBI121) or the 

15 plasmid pTiDESS were grown in the presence of kanamycin (50 g/ml) for 
three weeks to identify the roots which had been cotransformed with one of 
these plasmids. The fatty acid composition of the roots was determined as 
previously described (Browse et al., 1986). The abbreviations used in Table 
2 are as follows: 16:0, palmitic add; 16:1, palmitoleic add; 18:0, stearic add; 

20 18:1, oleic add; 18:2, Hnoleic add; 18:3, linolenic add. The values presented 
are the mean ± SD (n=12). 

From these results it can be seen that the production of rooty 
tumors containing pBI121 on wild tjTpe Arabidopsis or the fadS mutant had 
no effect on the fatty acid composition over non-pBI121 containing wild 

25 type Arabidopsis or fadS mutant. By contrast, transformation of the fadS 
mutant with the plasmid pTiDESS resulted in large increases in the 
content of linolenic acid. In contrast to the linolenic acid content of 6.7 +/- 
0.7% in the fadS mutant transformed with pBI121, the presence of 
pTiDESS resulted in accumulation of 42.1% of the fatty adds as linolenic 

30 acid. The increased content of linolenic acid was accompanied by a 
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decrease of corresponding magnitude in the content of linoleic add. Thus, it 
is clear that the fadS gene encodes a Unoleic acid desaturase. Introduction 
of the fadS gene into wild type tissues also resulted in significantly 
increased accumulation of linolenic acid and a corresponding decrease in 
5 linoleic acid (Table 2). Thiis, it is apparent from these results that the 
linoleic acid content of plant tissues can be increased by high level 
expression of a linoleic add desatiu*ase. In the present embodiment, the 
£ad3 gene was placed under transcriptional control of the constitutive high 
level CaMV 35S promoter carried on pBI121. The implication from these 
10 results is that expression from this promoter raised the level of expression 
of the fads gene to. levels higher than are normally achieved by expriession 
: from the endogenotiis fads promoter. The'restdts presented here indicate 
that the fadS gene has significant utility in genetic modification of higher 
plants to elevate linolenic add=^levels. 

15 Example 2 ' - ^ ' ' - - - . 

Antisense expression of fadS gene to decrease hnolenic add levels 

In order to decrease the linoleic acid desaturase activity by 
genetic engineering methodology, the cDNAinsert of pBNDESS was cloned 
into plant expression cassettes in an antisense orientation. A 959bp Bglll 

20 restriction fragment of pBNDESS was used in the antisense expression 
vectors. The fragment is from 152 nucleotides downstream of the initiating 
methionine codon of the cDNA to a second Bglll restriction site that is 
located near the C- terminus of the coding region. 189 nucleotides of the 
coding region are excluded from this fragment. Triple ligations were 

25 performed with the fadS gene fragment to construct two separate plant 
expression cassettes. 

A seed specific expression cassette was constructed by insertion 
of the Bgin fragment of pBNDESS in an antisense orientation behind the 
soybean promoter for the a' subunit of P-conglycinin (7S promoter). A 
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975bp Hindin to Bglll fragment containing the 7S promoter derived from 
pMON529 was prepared by digesting with Bglll for 30min at 37 followed 
by addition of Calf Intestinal Alkaline Phosphatase (CIAP) (Boehringer 
Mannheim). The reaction was allowed to proceed for 20min followed by 
5 purification of the linearized DNA using the GeneClean (Bio 101) 
purification system. The DNA was then digested with Hindlll. A fragment 
derived from pMON999 containing the Nopaline synthase 3' region and the 
pUC vector backbone was prepared by digestion with BamHI and 
treatment with CIAP. The DNA was purified by the GeneClean procedure 
10 and digested with HindlU. The fragment of pBNDESS was prepared by 
digestion with BglU. The three fragments were purified by agarose gel 
electrophoresis and the GreneClean procedure. 50 to 200ng of the purified 
fragments were ligated for one hour at room temperature followed by 
transformation into the E. coZi strain JMIOI. Resulting transformant 
15 colonies were used for plasmid preparation and restriction digestion 
analysis. Double digestion with Bglll and Ncol was used to screen for 
transformants containing the fadS gene in an antisense orientation. One 
clone was designated as correct and named pMONl3801. 

A second expression cassette was constructed to allow for 
20 constitutive expression of the antisense message in plants. A fragment 
containing the enhanced 35 S promoter was prepared from pMON999 by 
restriction digestion with Hindlll and Bglll followed by treatment with 
CIAP as above. The correct sized fragment was obtained by agarose gel 
electrophoresis and the GeneClean procedure. The Bglll to Hindlll vector 
25 fragment and the Bglll fragment of pBNDES3 which were purified above 
were used in this construction. Ligation, transformation and screening of 
clones were as described above. One clone was designated as correct and 
named pMONl3802. 
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In both pMONlSSOl and pMONl3802, the promoter, fadS gene 
and the Nos 3' region can be isolated on a NotI restriction fragment. These 
fragments can then be inserted into a unique NotI site of the vector 
pMON17227 to construct glyphosate selectable plant transformation 
5 vectors. The vector DNA is prepared by digestion with NotI followed by 
treatment with CIAP. The fadS containing fragments are prepared by 
digestion with NotI, agarose gel electrophoresis and purification with 
GeneClean. Ligations are performed with approximately lOOng of vector 
and 200ng of insert DNA for 1.5 hours at room temperature. Following 

10 transformation into the E. coli strain LE392, transformants were screen 
by restriction digestion to identify clones containing the fad3 expression 
cassettes. Clones in which transcription from the fadS cassette is in the 
same-direction as transcription- from the selectable marker werie designated^ 
as correct and named pMON13804 (FMV/CP4/E9, 7S/anti fadS/NOS) " 

15 (Figxu-e 8) and pMONl3805 (FMV/CP4/E9, E35S/anti fadS/NOS) (Figufe 

In preparation for transforming canola cells, pMON13804 and 
pMONl3805 were mated into Agrobacteritun ABI by a triparental mating 
with the helper plaismid pRK2013. 

20 Seeds from the plants produced by transformation were 

analyzed for alterations in fatty acid profile. Fatty acid methyl esters 
(FAMES) were prepared from seed tissue and analyzed by capillary gas. 
chromatography (Browse et al, 1986). For initial screening of plants, six 
seeds were pooled together from an individual plant. The seeds were 

25 crushed and FAMES extracts were made. Control plants, plants 
transformed with the selectable marker only (pMON17227), were also 
analyzed using the identical procedure. From the initial screen on pooled 
seed samples, several lines were identified which displayed a decreased level 
of linolenic acid. Lines with decreased levels of linolenic acid were 
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reanalyzed by determining fatty add profiles from individual seeds. Four to 
twenty individual seed were analyzed from candidate lines and from 
selected control plants. The resiilts of the FAMES analysis is summarized 
in Figure 9. 

5 Figure 9 shows the levels of fatty acids expressed in molar 

percent of twenty individual seed of the transgenic line 13804-51 as 
compared to control seed. Panel A discloses oleic acid, panel B discloses 
linoleic add and panel C disdoses linolenic add. 

The data in Figure 9 demonstrate that antisense expression of a 
10 linoleic add desaturase has significantly altered the fatty add profile of the 
resulting seed tissue. The percent of linolenic add has been reduced to a 
little over 2% of the total fatty add in the seed tissue. The percent of 
linoleic add has been reduced slightly and surprisingly, the percent of oleic 
acid in the seed has been increased to approximately 70%, This 
15 demonstrates the appUcabiUty of utilizing the fedS gene to manipulate the 
fatty add profile of crop plants. 

In order to demonstrate that the alteration in the fatty acid 
profile of the FAMES extracted fix)m total seed tissue would be reflected in 
the seed oil fraction, triglycerides from seeds of fadS antisense plants were 
20 characterized. Total lipid extracts were made by pooling ten seeds and 
grinding in 2ml of methanol:chloroform:water (4:2:1). The homogenate was 
allowed to stand for 20min and then debris was pelleted and discarded. To 
the supernatant 400^1 of chloroform rmethanol (2:1), 640jil of chloroform 
and 740^1 of water was added and vortexed. Phases were separated by 
25 centrifugation and the chloroform phase was recovered and dried imder 
nitrogen. Samples were resuspended in lOOjil of chloroform and lO^il was 
applied to silica gel G thin layer chromatography plates for separation. 
Two identical plates were prepared with one being charred after 
development to allow for alignment and location of spots to be analyzed on 
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the other plate. Plates were developed three times in petroleum 
ether:diethyl ether:acetic add (90:10:1). One plate was sprayed with 50% 
sulfuric add and heated in an oven at 90*C to allow for detection of lipids. 
Triglyceride fractions were identified as comigrating on the plate with 
5 purchased lipid standards (Sigma Chemical Co, cat #178-13). The charred 
plate was aligned with the identical plate and the triglyceride fractions were 
scraped from the plate. The fatty acids were transesterified to produce 
FAMES extracts for GC analysis by the same procedure as above. The 
fatty acid profiles of the triglyceride fractions are shown in Table 3 and 
10 demonstrate that this fraction have decreased linolenic add. 





Transgenic 


' Mol% 






15 


_ - line — 




18:2 . 


-.. 18:3 - 




" 17227-10 


.44 


30 


15.3 




17227-493 


65 


17 


6.9 




13804-47 


58 


21 


4.3 


20 


13804-50 


67 


20 


2.8 




13804-76 


59 


19 


5.0 




13804-117 


62 


21 


4.0 



Table 3 compares the fatty acid molar percentages of 
25 triglyceride fractions from control and transgenic lines. These above 
results provide clear evidence that the fadS gene can be vised to decrease 
the levels of hnolenic add in the storage oil of plants. The gene provides a 
tool for the manipulation of the fatty acid profile of seed storage oil to 
improve the products derived from the oil. 
30 A surprising result of this Example 2 is the effect the antisense 

fads gene has on the oleic acid content. The predse mechanism by which 
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antisense expression of a gene exerts an effect on the activity of an 
endogenous gene is \inclear but is obviously a function of the homology of 
the sense and antisense gene products. Based upon the above 
experimental result, it would not be unreasonable to predict that the 
5 portion of the fadS gene antisense message used contained a certain degree 
of homology with the genes providing the activity of one or more oleate 
desattirases. Therefore, a further advantage of the above invention is that 
it is possible that expression of a linoleic acid desaturase antisense 
message may exert an effect on oleate desaturase activity. 
10 The unexpected nature of the reduction in oleic add desaturase 

activity from the. antisense fadS plants is most apparent when one 
compares the fatty acid profiles from the antisense plants and the fadS 
mutant otArabidopsis. The levels of linoleic acid in the fadS mutant plants 
increased when linoleic acid desaturase activity was eliminated by 

15 mutation. This indicates that the activity of the oleate desaturase was not 
highly effected by the loss of linoleic acid desaturase activity or by the 
accimiulation of linoleic add. In the fadS mutant of Arabidopsis the level of 
linoleic add increased when the level of linolenic add decreased. However, a 
different pattern occurred in the antisense fad3 plants; In plants which 

20 exhibit a decreased percent of linolenic acid there is no corresponding 
increase, and is often a decrease, in the percent of hnoleic acid. There is an 
increase in the percent of oleate in the antisense fad3 plants. This would 
indicate that oleate desaturase activity is depressed in these plants. The 
effects on the fatty acid profile by the fadS mutation and the fadS antisense 

25 expression are not equivalent, indicating that antisense expression of a 
linoleic acid desaturase can depress an oleate desaturase activity in plants. 
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Modification of linolenic acid levels in soybean 

The isolation of the fadS gene from B. napus provides a tool to 
those with ordinary skill in the art to isolate the corresponding gene or 
5 cDNA from other plant species. There are many examples in which genes 
from one plant species have been used to isolate the homologous genes 
from another plant species. One such plant which could be improved upon 
by the modification of the level of linolenic add is soybean. 

Soybean oil typically contains linolenic add at a level of 7-9% of 

10 the fatty add in the oil. This level is undesirable because it promotes 
instability upon heating and imparts randdity to the finished product. The 
levels of linolenic add can be lowered by the expression of the soybean fadS 
gene or cDNA in an antisense orientation in the developing seed. The 
following example describes one method for the isolation of a fadS cDNA 

15 from soybean. However, similar procedures could be followed to isolate a 
genomic clone which coxold also be used to decrease the level of linoleic acid 
desaturase activity by antisense expression of a portion or all of the gene. 

The fads gene from B. napus is used as a probe to screen a cDNA 
library constructed from soybean mRNA. In order to isolate a cDNA to be 

20 used in decreasing linolenic acid in seed, the optimal tissue to use for the 
isolation of mRNA is developing seed. There is, however, flexibility in the 
choice of methods and vectors which can be used in the construction and 
analysis of cDNA libraries (Sambrook et al, 1989). Procedures for the 
construction of cDNA libraries are available from manufacturers of cloning 

25 materials or from laboratory handbooks such as Sambrook et.al, 1989. 
Once a suitable cDNA library has been constructed from soybean, all or a 
portion of the fad3 cDNA from B, napus is labeled and used as a probe of the 
library. DNA fragments can be labeled for radioactive or non-radioactive 
screening procedures. The library is screened under suitable stringency. 
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Conditions are dependent upon the degree of homology between the fadS 
gene of B. napus and soybean. Probe positive clones are plaque puriiBed by 
standard procedures and characterized by restriction enzyme mapping and 
DNA sequence analysis. Clones are concluded to be soybean fad3 based 
5 upon data obtained from the sequence analysis or by expression in plants. 

The entire clone or a portion thereof is placed down stream of a 
promoter sequence in an antisense orientation. Suitable promoters include 
seed specific promoters, such as the 7S (P-conglycinin) a'-subunit 
promoter, or less tissue specific promoters, such as the CaMV 35S 
10 promoter. An appropriate 3' non-translated region is placed downstream of 
the anidsense cDNA to allow for transcription termination and for the 
addition of polyadenylated nucleotides to the 3'end of the RNA sequence. 
This expression cassette is then combined with a selectable or scorable 
marker gene and soybean cells are transformed by free DNA delivery 
15 (Christou et al, 1990) or an Agrobacterium based method of plant 
transformation (Hinchee et al, 1988). Plants recovered are allowed to set 
seed and mature seed are used for the production of FAMES by the 
procedures outlined above. The FAMES extracts are analyzed by gas 
chromatography to identify plant Hnes with reduced levels of Unolenic acid 
20 in the seed. 

Alternatives to the above methods may include but are not 
limited to the use of degenerate oligonucleotides as probes to screen the 
library. Degenerate oligonucleotide probes would be most optimally 
designed by choosing short segments of the fadS amino add sequence where 
25 the degeneracy of the genetic code is limited or by choosing sequences which 
appear to be highly conserved between the fad3 gene of B. napus and other 
known linoleic acid desaturases, such as the desaturase from the 
cyanobacterium Synechocystis. The oligonucleotides could be labeled and 
used to probe a soybean cDNA library. Alternatively, degenerate 
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oligonucleotides could be used as primers for the isolation of a portion or all 
of the soybean cDNA by PGR amplification. 

Similar procediu*es could be used to isolate the homologous genes 
&om other plant species. Another preferred plant species which could be 
5 improved upon by the modification of the level of linolenic add is flax. Flax 
oil typically contains linolenic add at a level of 45-65% of the fatty add in 
the oil. This level is undesirable because it promotes instability upon 
heating and imparts randdity to the finished product. 

10 Sense expression of fad3 to obtain reduced levels of linolenic add 

- The cloning of the fadS gene also provides a tool to decrease the 
levels of linolenic add via the mechanism of co-suppression. The molecular 
mechanism of co-suppression occurs when plants are transformed with a 
gene that is identical or highly homologous to an allele found in the plants 

15 genome (Bird and Ray, 1991). There are several examples where 
expression of a chimeric gene in plants can result in a reduction of the gene 
product firom both the chimeric gene and the endogenous gene(s). Therefore 
the fads gene product of J3. napus may be reduced by transformation of fi. 
napus with all or a portion of the fadS cDNA which has been isolated. The 

20 resulting plant has reduced linoleic acid desaturase activity in tissues 
where the chimeric gene is expressed. The phenotype of reducing the 
linoleic add desaturase activity is a reduction in the levels of linolenic add. 
The mechanism of co-suppression could be applied to any plant species 
from which the fadS gene is cloned and the plant species is transformed 

25 with fads in a sense orientation. 

In order to reduce levels of Unolenic acid by the mechanism of co- 
suppression, a plant transformation construct is assembled with the fadS 
gene or cDNA in a sense orientation. The entire done or a portion thereof is 
placed downstream of a promoter sequence in a sense orientation. Suitable 
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promoters include seed specific promoters, such as the 7S (P-conglycinin) 
a-subvmit promoter, or less tissue specific promoters, such as the CaMV 
35S promoter. An appropriate 3' non-translated region is placed 
downstream of the fadS gene to allow for transcription termination and for 
5 the addition of polyadenylated nucleotides to the 3' end of the RNA 
sequence. This expression cassette is then combined with a selectable 
marker gene and B. napus cells are transformed by an Agrobacterium 
based method of plant transformation. Plants recovered are allowed to set 
seed and mature seed are used for the production of FAMES which are 
10 analyzed by gas chromatography to identify plant lines with reduced levels 
of linolenic add in the seed. 
Example 5 

Isolation of a chloroplast delta 15 d esaturase from Arahidopsbs 

A fragment of 959bp was excised from the fad3 cDNA insert 

15 using the restriction endonuclease BgUI, and labeled radioactively according 
to Feinberg and Vogelstein (1983). This fragment was used to probe a 
cDNA library from Arabidopsis thaliana as described above (Exeunple 1) 
except that the hybridization temperature was 52° C. Several cDNA 
clones were positive, and one of them (pVAl) was further characterized. 

20 Its deduced amino acid sequence exhibited a strong homology with fad3 
except at the N-terminus. The cDNA insert was placed xmder the control of 
the 35S promoter in the Ti vector pBI121, and the resulting construct, 
pBIVA12 was electroporated into Agrobacterium (058 pGVSlOl). The 
bacterium was used to transform the Arabidopsis mutant fadD. For 

25 transformation, plants were grown at 22° C with a light intensity of 
lOO/M.E/cm-2, until bolting (approximately 2 and 1/2 weeks). The stems 
(Imm-lOmm long) were removed and the plants were inoculated with a 
drop of an overnight culture of the bacterivim. The same operation was 
repeated 7 days afterwards. 
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The plants were then allowed to set seeds. The seeds were 
plated (2500 seeds per ISOmm petri dish) on MSO plates that contained 
50^g/inl kanamycin to select for plants that had integrated the construct. 
One transformant plant was obtained, and the fatty acids from its leaves 
were analyzed by gas chromatography (Table 4). The results obtained 
show that the pBIVA12 construct is able to reestablish the levels of 
linolenic and hexadecatrienoic adds in the fadD mutant at a level equal to 
or superior to the wild tjrpe. This demonstrates that pVA12 encodes the 
fadD gene. 

TABLE 4 " 





- fatty acid l- 


fedD 


V WT 


FadD 










pBIVA12 


15 












: ^ -16:0 - ; • ' 


13.0 


14.0 


14.9 




16:1 


4.9 


4.3 


4.2 




16:2. , 


8.7 


0.5 


0.3 




16:3 


3.0 


13.2 


9.5 


20 


18:1 


3.3 


2.3 


1.2 




18:2 


36.4 


10.9 


5.8 




18:3 


30.8 


54.6 


63.7 



Table 4 shows the complementation of the fadD mutant. 
25 Fatty acids were extracted from leaves of Arabidopsis according to Browse 
et al (1986) and were quantified (mol%) by gas chromatography. WT 
stands for the Columbia wild type. 
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Example 6 

Isolation of a second ch loroplast delta 15 desaturase from Arabidopsis 

A fragment of 959 bp was excised from the cDNA insert using 
the restriction endonuclease Bglll, and labelled radioactively according to 
5 Feinberg and Vogelstein (1983). This fragment was used to probe a cDNA 
library from Arabidopsis, exactly as described above (Example 5). Among 
the several positive clones obtained, the cDNA pVA34 was further 
characterized. Its deduced amino acid sequence exhibited 71.8% and 79.5% 
homology with fadS and fadD, respectively. The N-terminus resembled a 

10 chloroplast transit peptide, meaning that this protein is likely to be 
localized to the chloroplast. The strong homology with fadS and fadD 
suggests that the protein is also a delta 15 desatxarase. Aside from fadS 
and fadD, the only locus known to control delta 15 desaturation is the fadE - 
locus, which controls a temperature-induced delta 15 desaturase. 

15 Therefore, it is likely that the cDNA contained within the clone pVA34 
corresponds to the fadE loois. 

Exfiinpk 7 

Linoleic desaturase homology to pla nt oleic desaturases 

The linoleic desaturase genes are the first plant desaturases 

20 isolated whose proteins enz3niiatically perform the desaturation of an 
unsaturated fatty acid precursor. The reaction that linoleic desaturase 
performs and the cofactors it uses are likely to be very similar for the oleic 
desaturase reaction. Given the similar reactions, similar substrates and 
probably similar cofactors, it is likely that the oleic desatiu-ase genes and 

25 proteins have homology to the linoleic desaturase genes and proteins. That 
the genes share homology is supported by the finding that antisense 
expression of the linoleic acid desaturase message results in higher oleic 
acids levels, which experimentally indicates homology between the linoleic 
and oleic desaturases. These factors indicate that the linoleic desatiirase 
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protein and nucleic acid sequences provide useful information for isolating 
other lipid desaturase genes, particularly oleic desaturase genes. 

a. Identification of unknown cDNA seque nces in databases, 
5 Random cDNA sequencing generates a large number of 

sequenced clones but provides no information about the function of the 
encoded proteins. Homology to known proteins is the quickest method for 
identifying the protein function encoded in the sequenced cDNA. However, 
homology searches are informative only when a homology with a previously 

10 characterized protein are found. A cDNA sequence that is not homologous 
to any known protein remains in the \mkn6wn function category. Thus the 
results functionally identifying the linoleic desaturases by sequence and by 
their ability to coniplement mutatioiis in plant desaturase genes ' now 
provides a method for identifying the function and identity of random cDNA 

15 clones by their Iromology to the linoleic desaturases^ Additioiiaily oleic 
desaturases are identified by their homology with linoleic desaturases. 

A TFASTA search of the GenBank and EMBL public data 
bases for genes encoding proteins homologous to the protein sequence of the 
linoleic desaturase fadS has identified both linoleic desaturases and a 

20 second class of plant lipid desaturases likely to be oleic desaturases. In 
particular, sequences found in GenBank and EMBL and identified as 
T04093 and T12950 show significant homology to linoleic desaturases but 
show less homology than other linoleic desaturases. These sequences have 
30% homology to fadS and 56% similarity to fadS linoleic desaturase 

25 (TABLE 5). The full length clone of these cDNAs is obtained by standard 
methods and is inserted into plant gene expression and transformation 
vectors and transformed into fad2 Arabidopsis mutants to confirm the 
identity of the oleic desatiu-ase by genetic complemention as was described 
in the example with linoleic desaturase. 
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TABLES 

Comparison of Pad3 and T04093 Protein Sequences 

Percent Similarity: 52.381% Percent Identity: 30.476% 



fad3 101 GHGSFSDIPLLNSWGHILHSFILVPYHGWRISHRTHHONHGHVENDESW 150 
T04093 1 1:111:1111 :|:.||| || | | , 

"^"^^ ^ "FHSFLLVPYFSWKySHRRHHS^P^GSLERDEVF 34 

151 yPLPEKLYKi;,LP HSTRMLRYTVPLPMLAYPIYLWRSPGKEGSHF 195 

35 VPKQKSAlKWYGKYUONPLGRIMtaWQF.Vl^WPLYIAFlivSGR. . ,py 80 



30 



196 



NPYSSLFAPSERKLlATSTTCWSlMLATL^LSFLVDPV^VLKVYGVPYi 245 



II . 

20 - 81 DGFACHFFPNAPIYNDRERSRYTSLMRVF 



110 



b. Isolation of a oleic desaturase cDNA. 
2^ The protein sequence of plant linoleic desaturases can be used 

to isolate oleic desaturases. The conserved regions between the linoleic 
desaturases and the DesA oleic desaturase are functionally important and 
are conserved in the plant oleic desaturase proteins as well. These 
conserved amino acid sequences provide a method of isolating plant oleic 
desaturases. There are several regions of the linoleic desaturase fadS that 
are conserved in fadD, fadE and DesA. The consensus amino acid sequence 
is shown in Table 6, with the amino acids identical in all four proteins shown 
in capital letters. As described below, oligonucleotides designed to encode 
the amino acids sequences in the conserved regions are used to identify and 
35 isolate plant oleic desaturases. 
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TABLE 6 

FadB Protein Sequence and Peptide Targets 

MWAMDQRSNVNGDSGARKEEGFDPS AOPPFKIGDIRAAI PKHCWVKS PLRSMS YVTRD 
V- tplttp . . . spseed. . erf dpgapppf . laDIraaiPKhCVvKnpwksmsyVvxd 

DIraaiPKhCvTvK 
(la) DIraaiP 

(lb) aiPKhC 

(Ic) KhCwvK 



IFAVAALAMAAVYFDSWFLWPLYWVAQGTLFWAIFVLGHDCGHGSFSDIPLLNSWGHIL 
va - vf alaa . aay f nnW . IwPlyW . aqGTmf walFVlGHDCGHgSFsndp . INswGH . 1 
wflwPlvWvaaGT FVtGHPCGHqSF 
(2a) WflwPlyW (3a) FVlGHD 

15"" (2b) WflwP"""- (3b) VIGHDC 

. (2c).wPlyW (3c) GHDCGH 

(2d) WvaqGT (.3d) CGHgSF 

• HSF^LVPYHGWRISHRTHHQNHGHVE^^DESWPLPEKLYPJLPHSTRMLRYTVPLPMLAY^ 
20 .hssilvPyHgWRisHrtHHgnhghvEnDesWhPl ..e)ciy)cnlpk. trmf rf tlplpmlay 
PvHQWRisHrtiHH ^ - EUPeSWyp- " - - 

- - (4a) PyHgW (5a) EnDesW -- ■ - , 

(4b) HgWRisH (5b) DesWvP 

(4c) WRisHrtHH - . 

25 (4d) WRisH 

(4e) HrtHK 

PIYLWYRSPGKEGSHFNPYSSLFAPSERKLIATSTTCWSIMLAT . L\'YLSFLVDP\n:*VLK 
pfylw. rspgJc.gShyhDds . IF .okerkdvltScacwramaAl , IvcLnf t .gpiqmlK 

30 * 

VYGVPYIIFVMWLDA\'TyLKHHGHDEKLPWYRGKEWSYLRGGL . TTIDRDYG . IFNNIH 
lygiPywifvinWldfvTylHHhghedkipwyrgkeWSylrggL.tTldrDYg.winnih 
WldavTvlHH WSv J,?rqq); . t;T^gSr,l3.y 

(6a) WldavT (7a) WSylrggL 

35 (6b) TylKK (7b) L tTidrD 

(7c) TidrDY 
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HDIGTHVIHHLFPOIPHYHLVDATRAAKHVLGRYYREPKTSGAIPIHLVESLVASIK 

HDIgtHviHHLfpqIPhYhLveAteaaKpvlGkyyrEpk.sgplplhLlesl.ksik 

HDIatHviHHLfpgTPhY 

5 (8a) HDIgtH 

(8b) HviHHL 

(8c) HHLfpgl 
(8d) HLfpqIP 
(8e) LfpqlPhY 

10 

KDHYVSDTGDIVFYETDPDLYVYASDKSKIN* 
- dhyvsdt Gdwy Yeadp . lyg . . s * 

15 c. Isolation of the fadC ffadS) Gene fr om ArahiiljnDsis thaliann 

The fade gene (also referred to as fad6) encodes a 
chloroplastic omega-6 desaturase. 

The deduced amino acid sequences of the fadS gene from 
Brassica napus and the fadD and fadE genes from Arabidopsis thaliana 
20 were compared with the DesA gene from Synechocystis (Nature, 347:200, 
1990). The sequence GHDCGH was determined to represent the most 
highly conserved region of these proteins. Consequently, a degenerate 
oligomer was designed that contains all the possible condons for the 
sequence GHDCGH. This oligomer has the following sequence: 
25 GGNCAYGAYTGYGGNCA. 

An Arabidopsis thaliana cDNA phage library obtained from 
the laboratory of Dr. Ron Davis (PNAS, 88: 1731-1735) was used to screen 
for desaturase genes. This library was made using material from all above 
groxmd plant parts. 

30 Approximately 120,000 phage from the library were plated 

onto three plates and hybondN+ was then used to prepare three filters from 
esich plate (Molecular Cloning ' A Laboratory Manual , 2nd Edition. Eds. J. 
Sambrook, E. F. Fritsch, and T. Maniatis, Cold Spring Harbor Laboratory 
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Press, Cold Spring Harbor, New York 1989, hereafter *'Sambrook*'). Two 
filters from each plate were probed using the degenerate consensus 
oligomer which had been end-labelled with (32)P using T4 pol3niucleptide 
kinase. The hybridizations were performed in a solution that contained high 
5 amounts of tetramethylammonium chloride in order to minimize differences 
in the melting temperatures of the oligomers that together comprise the 
degenerate consensus oligomer. The hybridization solution had the 
following composition: 3 M tetramethylammonium chloride, 10 mM sodiimi 
phosphate pH 6.8, 1.25 mM EDTA, 0.5% SDS, 0.5% milk. Hybridization 

10 was carried out overnight at a temperature of 44^C. Filters were then 
washed four times, 20 minutes each time, with 6 x SSC + 0.15% SDS at" 
room temperature. Filters were then washed one time, for 30 minutes, with 
4 X SSC + 0.1% SDS at room temperature. The filters were then exposed to 
film for two days: - ^ - : " - ^^-^ : - 

15 - - The third set of filters that were made from each phage- 

containing plate were probed using DNA sequences from the three 
Arabidopsis desaturase genes that had already been identified: fadS, fadD 
and fadE. The fadS, fadD and fadE genes were labelled with (32)P and 
hybridized to the third set of phage filters in the following hybridization 

20 solution: 0.2 M NaCl, 20mM sodiimi phosphate pH 7.7, 2mM EDTA, 1% 
SDS, 0.5% milk, 10% dextran sulfate, 0.1% sodium pyrophosphate. 
Hybridization was carried out overnight at 65^C. Filters were washed four 
times, 30 minutes per time, in 2 x SSC + 0.15% SD at room temperature 
and then for 45 minutes with 1 x SSC + 0.1% SDS at 65^ C. The filters 

25 were then exposed to film for approximately two hours. 

The two sets of filters that were probed with the degenerate 
consensus oligomer showed about 60 positive phage per plate (or about 180 
total positive phage). Results from the third set of filters that were probed 
with the fad3, fadD and fadE genes indicated that only a small percentage 
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of the phage that hybridized to the consensus of oligomer contained the 
fads, fadD or fedE genes. 

Seventy-six of the phage that hybridized to the consensus 
ohgomer, but not to the fadS, fadD or fadE genes, were plaque purified. The 
5 purified phage were then spotted onto bacteria growing on sohd media on 
plates and allowed to form plaques. Several duplicate filters were then 
made of these plates (Sambrook). One of these filters was probed with the 
consensus oligomer, as described above. A second filter was probed with a 
mixture of the Arabidopsis thaliana fad3, fadD and fadE genes, as 
10 described above. 

In order to determine which of the 76 phage contained the 
same cDNA inserts as which other phage, some of the filters were probed 
with cDNA inserts from some of the phage. In order to perform this 
experiment, the cDNA inserts fi-om most of the phage were isolated by 

15 using oligomers that bound to DNA flanking the cDNA cloning site in the 
phage vector to isolate the cDNA sequences using the polymerase chain 
reaction (PGR). These cDNA sequences were labelled with (32)P (random 
hexamer labelling) and hybridized to the filters using the following 
hybridization solution: 30% formamide, 0.2M NaCl, 20mM sodium 

20 phosphate pH 7.7, 2mM EDTA, 1% SDS. 0.5% milk. 0.1% sodium 
pyrophosphate. The hybridizations were carried out for 14 hours at 65«C. 
The filters were washed foiir times 15 minutes per wash, with 2 x BSC + 
0.15% SDS at room temperature and were then exposed to film. 

The combination of the high formamide concentration in the 
25 hybridization solution and the high hybridization temperature meant that 
only DNA sequences that were virtually identical wotdd hybridize, allowing 
us to distinguish between nearly identical sequences. Several rounds of 
hybridizations using cDNA inserts from different phage were carried out 
imtil it had been determined which phage contained the same, or at least 
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extremely similar, cDNA inserts. On the basis of these experiments, we 
determined that all of the 76 phage contained one of four cDNA inserts. 
Sequence data was obtained from each of these four cDNAs. None of these 
cDNAs was foimd to be homologous to known desaturase genes, and so we 
5 feel that none of these four cDNAs is likely to encode a desaturase. 

Since the number of phage that hybridized to the consensus 
oligomer was quite high (about 180 phage hybridized in the initial screen 
described above), we were not able to analyze all of the positive phage in 
the initial experiments. So, an attempt was made to identify phage that 

10 hybridized to the consensus oUgomer but that did not contain the fadS, fadD 
of fadE genes or one of the foiur non-desaturase encoding clones that were 
identified in the first screen. In order to do this, between 500,000 and 
1,000,000 phage from the library described above were plated onto 10 
plates. "Three filters were made from each plate (Sambrook). Two of these 

15 three, sets oif filters were then hybridized with (32) P labelled consensus 
oligomer as described above except that hybridization was carried out at 
42^C instead of at 44-C. The third set of filters were hybridized with (32)P 
labelled DNA from the Arabidopsis fadS, fadD and fadE genes together with 
DNA from each of the four cDNA's identified in the first roimd of screening 

20 as hybridizing to the consensus oligomer but not encoding desaturases. 
This third set of filters were hybridized in: 30% formamide, 0.2 M NaCl, 
20mM sodium phosphate pH 7.7, 2mM EDTA, 1% SDA, 0.5% milk, 0.1% 
sodiimi pjnrophosphate at 65®C. All three sets of filters were hybridized for 
12 hours and then washed several times with 2 x SSC + 0.15% SDS at 

25 room temperature. The filters were then exposed to film. 

Approximately 200 phage from each plate hybridized to the 
consensus oligomer. 50-60% of these phage also hybridized to fadS, fadD, 
fadE or to one of the four clones identified in the first screen. About 58 
phage that hybridized to the consensus oligomer, but not to fadS, fadD, 
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fadE or one of the four previously identified clones, were plaque purified. 
The purified phage were then spotted onto a bacterial lawn growing oh solid 
media on a petri plate and the phage were allowed to form plaques. Several 
filters were prepared fi-om these plates and hybridized with (32)P labelled 
5 cDNA inserts fi^om various of the newly purified phage, as described above. 
In this manner, all of the phage identified in this second round of screening 
were foimd to contain one of eight different cDNA inserts. 

Sequence data was obtained fi-om each of the eight cDNA's. 
One of the cDNA's, which was contained within only one of the phage, was 
10 found to have some sequence similarity of a known desaturase gene fi-om 
cyanobacteria, the DesA gene. Further sequence information was obtained 
for this clone. This additional sequence showed very significant sequence 
similarity to the DesA gene, confirming that the clone contained a 
desaturase gene. The remainder of the cDNA contained within the clone 
15 was sequenced and compared with the sequences of other known 
desaturases. The new desaturase was 53.0% identical to DesA at the 
nucleotide level and 43.9%, 45.6% and 47.0% identical to S. napus fad3, 
Arabidopsis fadD and Arabidopsis fadE, respectively. As the gene 
contained within the clone was significantly more similar in sequence to the 
20 DesA gene (which is a delta-12 desaturase) than to fad3, fadD or fadE 
(which are omega-3 desaturases), the new desaturase was expected to be a 
delta-12 (= omega-6) desaturase. 

The additional sequence data also indicated that this new 
desaturase gene contains a region that has only a one base pair mismatch 
25 to the desaturase consensus sequence described above. This mismatch 
means that the new desaturase has the sequence GHDCAH instead of 
GHDCGH. 

A clone containing a full length cDNA for this gene was 
isolated and completely sequenced. This full length cDNA was sub-cloned 
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into the plant transformation vector pBII121 such that the gene is 
transcribed under the control of the 35S promoter. This construct was 
then used to complement the phenotype of a fadC mutant (Plant Phys. 90: 
522-529^ 1989) of Arabidopsis thaliana , indicating that the gene encodes a 
5 chloroplastic omega-6 desaturase. 

d Proposed isolation of fad2 

The most highly conserved peptide regions in the linoleic 
desaturases and the DesA desaturase were chosen as regions likely to be 
conserved in oleic desaturases. These 8 conserved regions are shown in 

10 TABLE 6. These regions were chosen on the following basis: These regions 
have areas highly conserved between the 3 linoleic desaturases and DesA, 
with at least 4 identical amino acids over a 10 amino acid span. Once a~ 
region was identified as conserved, the fad3 iinoleic desattu?ase sequence 

: : was used as the amina add sequence for the soim:e of homology to identify - 

15 oleic desaturases. This is because, both fadS and the non-plastid oleic 
desaturases are thought to.be localized to the endoplasmic reticuliun and 
are most likely to contain similar amino add sequences. 

Several peptide endpoints in each conserved area were chosen 
as the basis to subsequently design oligonucleotide probes for identifying 

20 the oleic desaturase gene. . The peptide endpoints were chosen to be 
between 5 and 9 amino acids in length. The peptide end points were chosen 
to end on the conserved (identical) amino adds, and most often to begin on 
conserved amino acids. The rationale is that within the larger conserved 
area, some amino add portions are more highly conserved than others, that 

25 15 to 27 (5 to 9 amino adds) nucleotides is a good primer size for PGR, and 
that for PGR it is important that the 3' end of the primer matches the 
target, with the conserved (identical) amino acids the most likely to be 
present in the oleic desaturases. These 28 "oleic desaturase** peptide 
targets (Table 6) are the basis oligonucleotides that are designed for 
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hybridizing to the oleic desatxirase cDNA sequences to identify and isolate 
the oleic desaturase cDNA clone. 

Several possible methods for designing oligonucleotides and 
isolating the genes encoding the target peptide regions are kno^. For a 
5 discussion of designing degenerate oligonucleotides see PCR Protocols - A 
Guide to Methods and Applications, Eds M. A. Innis, D. H. Gelfand, J J 
Sninsky and T. J. White, Academic Press, San Diego, California, 1990; and 
Sanxzxxx The two most common Screening methods using the 
oUgonucleotides are screening cDNA Ubraries and PCR amplification of 
10 specific cDNAs. Gene probes from fad3, fadD and fadE are used under 
stringent hybridization conditions to identify these cDNAs and discard 
them in the screen for oleic desaturase cDNA clones. The method for using 
degenerate oUgonucleotides to screen a cDNA libraiy has been described in 
the example above demonstrating the isolation of the fedC oleic desaturase 
gene. An immature plant seed active in oil biosynthesis, generally 2 to 5 
weeks after pollination, preferably about 3 to 4 weeks after poUination, of a 
plant such as Arabidopsis or canola is used as the source of mRNA for 
making cDNA. First strand cDNA is made fi-om the isolated mRNA and 
hybridized under stringent conditions in solution to an excess of biotinylated 
20 fadS, fadD and fadE cloned trDNAs. The hybrids and biotinylated nucleic 
adds are removed with strepavidin and a second round of substraction is 
done to remove any remaining fad3, fadD and fadE sequences. The cDNA 
remaining in solution is used for PCR reactions. (For 5' RACE, see below, a 
polyA tail is added to the first strand cDNA 3' end). 

A method that can readily evaluate a number of degenerate 
oligonucleotides probes is degenerate PCR (See chapters by Compton and 
by Lee and Caskey in PCR Protocols, cited above). In this method a 
degenerate set of oligonucleotides encompassing all the possible codon 
choices for the target peptide is synthesized (such degenerate 



15 



25 
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targets (Table 6) are the basis oligonucleotides that are designed for 
hybridizing to the oleic desaturase cDNA sequences to identify and isolate 
the oleic desaturase cDNA done. 

Several possible methods for designing oligonucleotides and 
5 isolating the genes encoding the target peptide regions are known. For a 
discussion of designing degenerate oligonucleotides see PCR Protocols - A 
Guide to Methods and Applications, Eds M. A. Innis, D. H. Gelfand, J J 
Snins^ and T. J. White, Academic Press, San Diego, California, 1990; and 
Sambrook. The two most common screening methods using the 
10 oligonucleotides are screening cDNA libraries and PCR amplification of 
..specific cDNAs.- Gene probes £rom fadS, fadD and fadE are used imder 
stringent hybridization conditions to identify these cDNAs and discard 
7-them m the screen for oleic desaturase cDNA clones. The method for using : 
degenerate, oligonucleotides .to screen a cDNA library has-been described ih^ 
15 the; example above demonstrating the isolation of the fadC oleic desatiirase 
gene. An immatture plant seed active in oil biosynthesis, generally 1 to 5 - 
weeks after pollination, preferably about 2 to 4 weeks after pollination, of a 
plant such as Arabidopsis or canola is used as the source of mRNA for 
- making cDNA. First strand cDNA is made firom the isolated mRNA and 
20 hybridized under stringent conditions in solution to an excess of biotinylated 
fads, fadD and fadE cloned cDNAs. The hybrids and biotinylated nucleic 
acids are removed with strepavidin and a second round of substraction is 
done to remove any remaining fadS, fadD and fadE sequences. The cDNA 
remaining in solution is used for PGR reactions. (For 5' RACE, see below, a 
25 polyA tail is added to the first strand cDNA 3' end). 

A method that can readily evaluate a nxmiber of degenerate 
oligonucleotides probes is degenerate PCR (See chapters by Compton and 
by Lee and Caskey in PCR Protocols^ cited above). In this method a 
degenerate set of oligonucleotides encompassing all the possible codon 
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TABLE7 

Peptide Targets for Pad2 Cloning 



Peptide sequence 





±8L 


DIRAAIP 




XD 


AIPKHC 






ixHCWVK 






TaTT?T TATtiT VTa7 


10 




WFLWP 






WPLYW 






WVAQGT 






FVLGHD 






VLGHDC 


XD 




GHDCGH 






CGHGSF 






PYHGW 






HGWRISH 




4c-l 


WRISHRTHH 


20 


4c-2 






4d 


WRISH 




4e 


HRTHH 




5a 


ENDESW 




5b 


DESWVP 


25 








6a 


WLDAVT 




6b 


TYLHH 




7a-l 


WSYLRGGL 




7a-2 




30 


7b 


LTTIDRD 




7c 


TIDRDY 




8a 


HDIGTH 




8b 


HVIHHL 


35 


8c 


HHLFPQI 


8d 


HLFPQIP 




8e 


LFPQIPHY 



Oligo sequence 5 ' - 3 ' 

GAYATHMGNGCNGCNATHCC 

GCNATHCCNAARCAYTG 

AARCAYTGYTGGGTNAA 

TGGTTYYTNTGGCCNYTNTAYTGG 

TGGTTYYTNTGGCCN 

TGGCCNYTNTAYTGG 

TGGGTNGCNCARGGNAC 

TTYGTNYTNGGNCAYGA 

GTNYTNGGNCAYGAYTG 

GGNCAYGAYTGYGGNCA 

TGYGGNCAYGGNWSNTT 

CCNTAYCAYGGNTGG 

CAYGGNTGGMGNATHWSNCA 

TGGMGNATHTCNCAYMGNACNCAYCA* 

TGGMGNATHAG YCAYMGNACNCAYCA * 

TGGMGNATHWSNCAY 

CAYMGNACNCAYCAY 

GARAAYGAYGARWSNTGG 

GAYGARWSNTGGGTNCC 

NGTNACNGCRTCNARCCA 

RTGRTGNARRTANGT 

ARNCCNCCNCKNARRTARCTCCA * 

ARNCCNCCNCKNARRTANGACCA * 

RTCNCKRTCDATNGTNGTNA 

RTARTCNCKRTCDATNGT 

RTGNGTNCCDATRTCRTG 

NARRTGRTGDATNACRTG 

DATYTGNGGRAANARRTGRTG 

GGDATYTGNGGRAANARRTG 

RTARTGNGGDATYTGNGGRAANA 



* synthesize 4c and 7a in two pools each to limit the 
40 degeneracy 

Oligos for 6a - 8e are the complement of the coding 
sequence 
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TABLE 8 

Table of OUgomers for PGR RACE of £Bd2 





jrepuQe tr 




P UlU 


OimilaX 1 Itjr 


iJliillirli i.l»jf Ul 










with L.26296 


Last 10 n t 




la 


OA 






fin 05. 




ID 


17 




fifi 
OO 


fin 




ic 


17 


QO 


DO 


fin 


10 














2a 


24 


64 


79 


100 






15 




4 O 


fin 




2c 


15 


48 


100 


100 




2d 


17 


128 


76 


90 
















3a . . 




. . ..,r:384 - ^. 


^. - 76 


7.0... 




3b 


■ ,17 : 


384 


82 


80 -. 




3c 


' 17 


128 


88 


• 90 




3d... r^' 




r 384 


-.-:8.2- . • r.. "'"1 


70 


20 














- -- '4a 




-""^^64 


QQ.-:-r:-..i 


7a^^'" 




- - -.^b - 


:: 2^ J • 


^192 " 


75. - ■■■ 


90- 




4c 


26 


96* 


81 


80 




■ ■id " 


15 


216 


87 


90 


25 


4e. 


15 


192 


• . 87 


oO 




5a 


18 


96 


72 


80 




5 b _ 


17 


- . 96. 


. 76. 


80 


30 


6a 


18 


256 


78 


80 




6b 


15 


192 


93 


100 




7a 


23 


256* 


78 


60 




7b 


20 


384 


90 


80 


35 


7c 


18 


192 


94 


90 




8a 


18 


384 


72 


70 




8b 


18 


192 


89 


80 




8c 


21 


384 


81 


100 


40 


8d 


20 


192 


80 


90 




8e 


23 


192 


83 


70 



done in two oligo pools 
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Table 7 shows the 28 peptide targets from the eight conserved 
regions and the 30 degenerate oligonucleotides derived from the peptide 
sequences. The degeneracy was kept to less than 516 fold, for those 
instances where more degeneracy occurred, by the use of deox3dnosine 
5 (Sambrook et al.) and by not including the last nucleotide in the last codon, 
and in two cases by the use of two subpools. Table 8 shows the amotmt of 
degeneracy for each designed oligonucleotide sequence and the amoxmt of 
homology of the oligonucleotides to the Arabidopsis oleic desaturase fad2 
(Accession No. L26296). Also shown in Table 8 is the percent homology in 

10 the last 10 nucleotides on the 3' end of each primer, since this region is most 
important for annealing and elongation under PGR conditions. It is 
expected that both 10 of 10 and 9 of 10 homology matches, and probably 8 
of 10 homology matches in the 3' primer regions will serve as eflBcient PGR 
primers. Note that for oligonucleotide sets la through 5b (for 3* RAGE) the 

15 strand direction is the same as the mRNA while for oligonucleotide sets 6a 
through Be (for 5' RACE) the direction is opposite of the mRNA. Four 
oligonucleotides have a 10 of 10 match in the 3' position, 6 oligonucleotides 
match 9 of 10 in the 3' position and 12 match in 8 of 10 nucleotides in the 3' 
position. Ohgonucleotides corresponding to peptides 2a, 2c, 2d, 3c, 4b, 4d, 

20 6b, 7c, 8c, and 8d show 90% or greater homology in their last 10 
nucleotides and anneal to the oleic desatiu*ase gene and serve as primers to 
this gene. This demonstrates the vaUdity of using the conserved regions of 
the plant linoleic desaturases and DesA to identify and isolate plant oleic 
desaturases. 

25 The first round of PGR products are subjected to two roimds of 

subtraction using biotinylated fadS, fadD and fadE cloned cDNA to remove 
any hybridizing fadS, fadD and fadE sequences with strepavidin. This 
subtracted DNA is greatly enriched for fad2 sequences and depleted of fad3, 
fadD and fadE sequences. These 30 samples are run on agarose gels, 
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blotted and hybridized with pools of probe from the 30 samples. Pools of 5 
of each of the 30 PGR samples are labeled with random primers and 
hybridized to the blots of the 30 samples, for a total of 6 blots hybridized 
with 6 pools of 5 probes. Additionally, a pool of fad3, fadD and fadE probe is 
5 hybridized to a duplicate blot. Bands that do not hybridize strongly to fad3, 
£adD and fadE but do cross hybridize to probe made from a different sample 
are strong candidates for fad2 as fad2 is likely to be the only DNA amplified 
in two or more independent PGR reactions. Positively hybridizing lanes 
identify samples to amplify by PGR using the same primers as in the initial 

10 reaction for 5 tolO cyclies and the PGR products are cloned into plasmid 
vectors. The'saine probe that recognized the sample on the blot is used^to 
screen the library and identify the hybridizing clone. Positive clones are 
" sequenced and identified as fad2 clones by their homology but non-identity 
with fad3, and fiirther chsuTacterized as described below. 

15 In the event that fad2 sequences are hot sufficiehtly enriched 

in one round of PGR to be identified, a. second round of PGR is performed. If 
the lack of detection is due to insufficient amplification of fad2, then 
another round of PGR using the same primers on the subtracted PGR first 
round samples and the same simple screen as described above will identify 

20 fad2. If there are too many competing non-specific reactions then a second 
round of PGR using a different primer combination will remove non-specific 
amplifications and enrich for fad2. To further enrich for fad2 sequences 
each of the initial 30 PGR samples (one for each oligonucleotide in Table 7) 
after subtraction as described above, is subjected to a second roxmd of PGR 

25 reactions using a different primer combination than the first reaction. One 
of the primers woxild be the same degenerate oligonucleotide primer as in 
the first PGR reaction. The second primer would now be from one of the 30 
primers in Table 7 from the opposite class, ie, primers from la to 5b form 
matched sets with primers from 6a to 8e (primers la to 5b are in the sense 
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direction while primers 6a to 8e are in the antisense direction). For 
example, if oligonucleotide la was used initially, it is used again as one of 
the two primers and the second primer is each of the 6a to 8e 
oligonucleotides for a total of 11 separate PGR reactions. In total the 30 
5 initial reactions result in 418 second cycle PGR reactions, a number easily 
handled by PGR technology. Essentially this second PGR cycle 
accomplishes a ^^nested" or sequential PGR reaction step after removing all 
the linoleic desaturases by the subtraction step. This increases the 
amplification as well as the specificity. Identification of samples containing 
10 fad2 are performed similarly as described above, with the 418 samples dot 
blotted onto 22 filters and probed with 21 pools of 20 samples and with a 
pool of fad3, fadD and fedE. Again, any sample that cross hybridizes with 
an independent probe sample and does not hybridize to fad3, fadD and fadE 
is a candidate for containing fad2 in the sample. If fadS, fadD and fadE 
15 hybridization is still present, another biotinylation/stepavidin subtraction 
should remove it. Positively hybridizing samples are run on gels, the band 
identified by hybridization and isolated for cloning. This second set of PGR 
reactions produces PGR products of a predictable size since both primers 
are within the coding region where little variation in size is expected. Thus 
20 the presence of a band of the expected size on a gel is diagnostic of fad2, 
particularly if hybridization of a blot of such a gel with a fad3, fadD and 
fadE probe indicates the band is not due to fad3, fadD and fadE 
contamination. After cloning the inserts in E. coli, the resulting plasmids 
containing the insert are identified by hybridization. They are sequenced 
25 and identified as oleic desaturases by their homology but non-identity with 
the linoleic desaturases, as in the examples described previously. The ftdl 
length clone of these cDNAs is obtained by standard methods and inserted 
into plant gene expression and transformation vectors and transformed 
into Arabidopsis fad2 mutants to confirm the identity of the oleic 
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desaturase by genetic complemention as was described in the example with 
linoleic desaturase. 

Thus in this approach to isolating the plant oleic desaturases, 
the total number of peptide regions is 8, comprised of 28 smaller peptide 
5 targets. This leads to set of 30 degenerate oUgonudeotides, that are used in 
the PGR amplification and screening of the PGR products. Subtraction of 
interferinjg fad3, fadD and fadE sequences is used at several points. If 
necessary a second round of PGR reactions with paired internal primers 
gives extra amplification and specificity. This approach identifies the plant 
10 oleic desatxirases, and the sequence of the isolated clones should confirm 
their identity by their homology to the plant linoleic desaturases as 
described. Thus a defined approach to isolating the plant oleic desaturases 

~. ^ - -firom the information about linoleic desaturases is presented here. The 
example; given here is for Arabidopsis-or canola oleic desaturases, but the 

: . . 15 approach-is not limited to those plants as the oleic desaturases" are 
probably highly conserved in most plants. Thus once one plant oleic 
desaturase is isolated, the sequence information is used to isolate the genes 
fi*om other plant species by direct hybridization or by an approach similar 
to. the one described here. 

20 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Monean^o Company 

(B) STREET: 800 North Lindbergh Boulevard 

(C) CITY: St. Louie 

(D) STATE: Mia sour i 

(E) COUNTRY: United States of America . 
(P) POSTAL CODE (ZIP): 63167 

(G) TELEPHONE: (314)694-3131 

(H) TELEFAX: (314)694-5435 

(ii) TITLE OF INVENTION: Altered Linolenic and Linoleic Acid Content 
in Plants 

(iii) NUMBER OF SEQUENCES: 72 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
- (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: _ . . 

(A) APPLICATION NUMBER: US 08/156551 

(B) FILING DATE: 22-NOV-1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/014431 

(B) FILING DATE: 05-FEB-1993 

(2) INFORMATION FOR SEQ ID NO:l: - - - . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 87. .1238. 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AATCCATCAA ACCTTTATTC ACCACATTTC ACTGAAAGGC CACACATCTA GAGAGAGAAA 60 
CTTCGTCCAA ATCTCTCTCT CCAGCG ATG GTT GTT GCT ATG GAC CAG CGC AGC 113 
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Met Val Val AXa Met: Aep Gin Arg Ser 
I 5 

AAT GTT AAC GGA GAT TCC GGT GCC CGG AAG GAA GAA GGG TTT GAT CCA 161 
Asn Val Aen Gly Asp Ser Gly Ala Arg Lys Glu Glu Gly Phe Asp Pro 
10 15 20 25 



AGC 6CA CAA CCA COG TTT AAG ATC GGA GAT ATA AGG GCG GCG ATT CCT 
Ser Ala Gin Pro Pro Phe Lye lie Gly Aep lie Arg Ala Ala He Pro 
30 35 40 

AAG CAT TGC TGG GTG AAG AGT CCT TTG A6A TCT ATG AGC TAC GTC ACC 
Lys His Cys Trp Val Lys Ser Pro Leu Arg Ser Met Ser Tyr Val Thr 
45 50 55 

AGA GAC ATT TTC GCC GTC GCG 6CT CTG GCC ATG GCC GCC GTG TAT TTT 
Arg Asp He Phe Ala Val Ala Ala Leu Ala Met Ala Ala Val Tyr Phe 
60 65 70 

GAT AGC TGG TTC CTC TGG CCA CTC TAC TGG GTT GCC CAA GGA ACC CTT 
Asp Ser Trp Phe Leu Trp Pro Leu Tyr Trp Val Ala Gin Gly Thr Leu 
75 80 85 

TTC TGG GCC ATC TTC GTT CTT GGC CAC GAC TGT GGA CAT GGG AGT TTC 
Phe Trp Ala He Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe 
®° 95 100 105 

TCA GAC ATT CCT CTG CTG AAC ACT GTG GTT GGT CAC ATT CTT CAT TCA 
ser Asp He Pro Leu Leu Asn Ser Val Val Gly His He Leu His Ser 
110 115 

TTC ATC CTC GTT CCT TAC CAT GGT TGG AGA ATA AGC CAT CGG ACA CAC 
Phe He Leu Val Pro Tyr His Gly Trp Arg He Ser His Arg Thr His 
125 130 135 

CAC CAG AAC CAT GGC CAT GTT GAA AAC GAC GAG TCT TGG GTT CCG TTG 
HXB Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp Val Pro Leu 
1*0 145 ISO 

CCA GAA AAG TTG TAC AAG AAC TTG CCC CAT AGT ACT CGG ATG CTC AGA 
Pro Glu Lys Leu Tyr Lys Asn Leu Pro His Ser Thr Arg Met Leu Arc 
155 160 165 

TAC ACT GTC CCT CTG CCC ATG CTC GCT TAC CCG ATC TAT CTG TGG TAC 
Tyr Thr Val Pro Leu Pro Met Leu Ala Tyr Pro He Tyr Leu Trp Tyr 

"5 180 ^ xls 

AGA AGT CCT GGA AAA GAA GGG TCA CAT TTT AAC CCA TAC AGT AGT TTA 
Arg Ser Pro Gly Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser Leu 
1^0 195 200 

TTT GCT CCA AGC GAG AGG AAG CTT ATT GCA ACT TCA ACT ACT TGC TGG 
Phe Ala Pro Ser Glu Arg Lys Leu He Ala Thr Ser Thr Thr Cys Trp 
205 210 215 

TCC ATA ATG TTG GCC ACT CTT GTT TAT CTA TCG TTC CTC GTT GAT CCA 



209 



257 



305 



353 



401 



449 



497 



545 



593 



641 



689 



737 



785 
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Ser lie Met Leu Ala Thr Lieu Val Tyr L/eu Ser Phe Z«eu Val Asp Pro 
220 225 230 

GTC ACA GTT CTC AAA GTC TAT GGC GTT COT TAG ATT ATC TTT GTG ATG 833 
Val Thr Val Leu Lye Val Tyr Gly Val Pro Tyr He lie Phe Val Met 
235 240 245 

TGG TTG GAG GOT GTC ACG TAG TTG CAT CAT CAT GGT CAC GAT GAG AAG ' 881 

Trp Leu Asp Ala Val Thr Tyr Leu His His Hie Gly His Asp Glu Lys 
250 255 260 265 

TTG CCT TGG TAG AGA GGC AAG GAA TGG AGT TAT TTA CGT GGA GGA TTA 929 
Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 
270 275 280 

ACA ACT ATT GAT AGA GAT TAG GGA ATC TTC AAC AAC ATC CAT CAC GAC . 977 
Thr Thr He Asp Arg Asp Tyr Gly He Phe Asn Asn He His His Asp 
285 290 295 

ATT GGA ACT CAC GTG ATC CAT CAT CTT TTC CCA CAA ATC CCT CAC TAT 1025 
He Gly Thr His Val He His His Leu Phe Pro Gin He Pro His Tyr 
300 305 310 

CAC TTG GTC GAT GCC ACG AGA GCA GCT AAA CAT GTG TTA GGA AGA TAG 1073 
His Leu Val Asp Ala Thr Arg Ala Ala Lys His Val Leu Gly Arg Tyr 
315 320 325 

TAG' AGA -GAG COG AAG ACG TCA GGA GCA ATA CCG ATT CAC TTG GTG GAG 1121 
Tyr Arg Glu Pro Lys Thr Ser Gly Ala lie Pro He His Leu Val Glu 
330 335 340 345 

AGT TTG GTC GCA AGT ATT AAA AAA GAT CAT TAG GTC AGT GAC ACT GGT .1169 
Ser Leu Val Ala Ser He Lys Lys Asp His Tyr Val Ser Asp Thr Gly 
350 355 360 

GAT ATT GTC TTC TAC GAG ACA GAT CCA GAT CTC TAC GTT TAT GCT TCT 1217 
Asp He Val Phe Tyr Glu Thr Asp Pro Asp Leu Tyr Val Tyr Ala Ser 
365 370 375 

GAC AAA TCT AAA ATC AAT TAACTTTTCT TCCTAGCTCT ATTAGGAATA 1265 
Asp Lys Ser Lys He Asn 
380 

AACACTCCTT CTCTTTTACT TATTTGTTTC TGCTTTAAGT TTAAAATGTA CTCGTGAAAC 1325 
CTTTTTTTTA TTAATGTATT ' TAOGTTAC 1353 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: protein 
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(xl) SEQUENCE DESCRIPTION) SEQ ID NOs2t 

Met Val Val Ala Met Asp Gin Arg Ser Asn Val Asn Gly Asp Ser Gly 
■ ^ 5 10 15 

Ala Arg Lys Glu Glu Gly Phe Asp Pro Ser Ala Gin Pro Pro Phe Lys 
20 25 30 

He Gly Asp He Arg Ala Ala He Pro Lys His Cys Trp Val Lys Ser 
35 40 45 

Pro Leu Arg Ser Met Ser Tyr Val Thr Arg Asp He Phe Ala Val Ala 
50 55 60 

Ala Leu Ala Met Ala Ala Val Tyr Phe Asp Ser Trp Phe Leu Trp Pro 
€5 70 75 80 

Leu Tyr Trp Val Ala Gin Gly Thr Leu Phe Trp Ala He Phe Val Leu 
65 90 95 

Gly His Asp Cys Gly His Gly Ser Phe Ser Asp He Pro Leu Leu Asn 
100 105 110 

Ser Val Val Gly His lie Leu His Ser Phe He Leu Val Pro Tyr His 
115 120 125 

Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His Val 
130 135 140 

Glu Asn Asp Glu Ser Trp Val Pro Leu Pro Glu Lys Leu Tyr Lys Asn 
"5 150 155 160 

Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Val Pro Leu Pro Met 
165 170 175 

Leu Ala Tyr Pro He Tyr Leu Trp Tyr Arg Ser Pro Gly Lys Glu Gly 
180 185 190 

Ser His Phe Asn Pro Tyr Ser Ser Leu Phe Ala Pro Ser Glu Arg Lys 
195 200 205 

Leu He Ala Thr Ser Thr Thr Cys Trp Ser He Met Leu Ala Thr Leu 
210 215 220 

Val Tyr Leu Ser Phe Leu Val Asp Pro Val Thr Val Leu Lys Val Tyr 
225 230 235 240 

Gly Val Pro Tyr He He Phe Val Met Trp Leu Asp Ala Val Thr Tyr 
245 250 255 

Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr Arg Gly Lys 
260 265 270 

Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr He Asp Arg Asp Tyr 
275 280 285 
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Gly ZXe Phe Aan Aan lie His Hie Asp lie Gly Thr Hie Val lie Hie 
290 295 300 

Hie I.eu Phe Pro Gin lie Pro His Tyr His I.eu Val Asp Ala Thr Arg 
305 310 315 320 

Ala Ala Lys His Val Leu Gly Arg Tyr Tyr Arg Glu Pro Lys Thr Ser 
325 330 335 

Gly Ala lie Pro lie His Leu Val Glu Ser Leu Val Ala Ser lie Lys 
340 345 350 

Lys Asp His Tyr Val Ser Asp Thr Gly Asp lie Val Phe Tyr Glu Thr 
355 360 365 

Asp Pro Asp Leu Tyr Val Tyr Ala Ser Asp Lys Ser Lys He Asn 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 
. (B) TYPE: nucleic acid- 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 

(ii) -MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
GGCGATGCTG TOGGAATGGA OGATA 25 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTTGGAGCCA CTATCGACTA CGCGATC 27 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOsS: 
CC6ATCTCAA GATTACGGAA T 
(2) INFORMATION FOR SEQ ID NOtSt 

(i) SEQXra:NCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTCCTAAT6C A66A6TCGCA TAAG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGGAGTCGCA TAAGGGAG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 PCT/US94/01321 

-75- 

GGGAAGTGAA TGGA6AC 17 
(2) INFORMATION FOR SEQ ID NO:9s 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 1645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 125. .1465 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



6GAAAACACA AGTTTCTCTC ACACACATTA TCTCTTTCTC TATTACCACC ACTCATTCAT 60 

AACA6AAACC CACCAAAAAA TAAAAAGAGA GACTTTTCAC TCTGGGGA6A GAGCTCAAGT 120 

TCTA ATG GCG AAC TTG GTC TTA TCA GAA TGT GGT ATA CGA CCT CTC CCC 169 
Me^ Ala Asn teu Val Leu Ser Glu Cye Gly lie Arg Pro Leu Pro 
1 5 10 ^5 

AGA ATC TAC ACA ACA CCC A6A TCC AAT TTC CTC TCC AAC AAC AAC AAA 217 
Arg lie Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Aen Ash Lye 
20 25 30 

TTC AGA CCA TCA CTT TCT TCT TCT TCT TAC AAA ACA TCA TCA TCT CCT 265 
Phe Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lye Thr Ser Ser Ser Pro 
35 40 45 

CTG TCT TTT GGT CTG AAT TCA CGA GAT GGG TTC ACG A6G AAT TGG GCG 313 
Leu Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arig Asn Trp Ala 
50 55 60 

TTG AAT GTG AGC ACA CCA TTA ACG ACA CCA ATA TTT GAG GAG TCT CCA 361 
Leu Asn Val Ser Thr Pro Leu Thr Thr Pro lie Phe Glu Glu Ser Pro 
65 70 75 

TTG GAG GAA GAT AAT AAA CAG AGA TTC GAT CCA GGT GCG CCT CCT CCG 409 
Leu Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Ala Pro Pro Pro 
80 85 90 95 

TTC AAT TTA GCT GAT ATT AGA GCA GCT ATA CCT AAG CAT TGT TGG GTT 457 
Phe Asn Leu Ala Asp lie Arg Ala Ala lie Pro Lys His Cys Trp Val 
100 105 110 

AAG AAT CCA TGG AAG TCT TTG AGT TAT GTC GTC AGA GAC GTC GCT ATC 505 
Lys Asn Pro Trp Lys Ser Leu Ser Tyr Val Val Arg Asp Val Ala lie 
il5 120 125 
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553 



GTC TTT GCA TTG GCT GCT GGA OCT 6CT TAC CTC AAC AAT TGG ATT GTT 
Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu Asn Aan Trp lie Val 
130 135 140 



TGG CCT CTC TAT TGG CTC GCT CAA GGA ACC ATG TTT TGG GCT CTC TTT 
Trp Pro Leu Tyr Trp Leu Ala Gin Gly Thr Met Phe Trp Ala Leu Phe 
145 150 155 



601 



697 



GTT CTT GGT CAT GAC TGT GGA CAT GGT AGT TTC TCA AAT GAT CCG AAG 649 
Val Leu Gly Hie Asp Cyo Gly His Gly Ser Phe Ser Asn Aep Pro Lys 
1" 165 170 175 

TTG AAC AGT GTG GTC GCT CAT CTT CTT CAT TCC TCA ATT CTG GTC CCA 
Leu Asn Ser Val Val Gly His Leu Leu His Ser Ser lie Leu Val Pro 
180 185 190 

TAC CAT GGC TGG AGA ATT AGT CAC A6A ACT CAC CAC CAG AAC CAT GGA 745 
Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly 
195 200 205 

CAT GTT GAG AAT GAC GAA TCT TGG CAT CCT ATG TCT GAG AAA ATC TAC 793 
His Val Clu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys He Tyr 
210 215 220 



841 



889 



AAT ACT TTG GAC AAG CCG ACT AGA TTC TTT AGA TTT ACA CTG CCT CTC 
Asn Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu 
225 230 235 

GTG ATG CTT GCA TAC CCT TTC TAC TTG TGG GCT CGA AGT CCG GGG AAA 
Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys 

245 250 255 

AAG GGT TCT CAT TAC CAT CCA GAC AGT GAC TTG TTC CTC CCT AAA GAG 937 
Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu 
260 265 270 

AGA AAG GAT GTC CTC ACT TCT ACT GCT TGT TGG ACT GCA ATG GCT GCT 
Arg Lys Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Ala 
275 280 285 



985 



1033 



1081 



CTG CTT GTT TGT CTC AAC TTC ACA ATC GGT CCA ATT CAA ATG CTC AAA 
Leu Leu Val Cys Leu Asn Phe Thr He Gly Pro He Gin Met Leu Lys 
290 295 300 

CTT TAT GGA ATT CCT TAC TGG ATA AAT GTA ATG TGG TTG GAC TTT GTG 
Leu Tyr Gly He Pro Tyr Trp He Asn Val Met Trp Leu Asp Phe Val 
305 310 315 

ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG CTT CCT TGG TAC CGT 1129 
Thr Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arc 
320 325 330 335 

GGC AAG GAG TGG AGT TAC CTG AGA GGA GGA CTT ACA ACA TTG GAT CGT 1177 
Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg 
340 345 350 
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GAC TAC GGA TTG ATC AAT AAC ATC CAT CAT GAT ATT GGA ACT CAT GTG 1225 
Asp Tyr Gly Leu lie Aen Asn He Hie Hie Asp He Gly Thr His Val 
355 360 365 

ATA CAT CAT CTT TTC CC6 CAG ATC CCA CAT TAT CAT CTA 6TA GAA GCA 1273 
He His His Leu Phe Pro Gin He Pro His Tyr His Leu Val 61u Ala 
370 375 380 

ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT TAC AGG GAG CCT GAT 1321 
Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr Tyr Arg Glu Pro Asp 
385 390 395 

AAG TCT GGA CCG TTG CCA TTA CAT TTA CTG GAA ATT CTA GCG AAA AGT 1369 
Lys Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Ala Lys Ser 
400 405 410 415 

ATA AAA GAA GAT CAT TAC GTG AGC GAC GAA GGA GAA GTT GTA TAC TAT 1417 
He Lys Glu Asp His Tyr Val Ser Asp Glu Gly Glu Val Val Tyr Tyr 
420 425 430 

AAA GCA GAT CCA AAT CTC TAT GGA GAG GTC AAA GTA AGA GCA GAT TGAAATGAAG 
1472 

Lys Ala Asp Pro Asn Leu Tyr- Gly Glu Val . Lys Val Arg Ala Asp 

435 440 445 

CAGGCTTGAG- ATTGAAGTTT* TTTCTATTTC AGAC.CAGCTG ATTTTTTGCT TACTGTATCA 1532 

ATTTATTGTG TCACCCACCA GAGA6TTAGT ATCTCTGAAT ACGATCGATC AGATGGAAAC 1592 

AACAAATTTG^TTTGCGATAC T6AAGCTATA TATACCATAA AAAAAAAAAA AAA 1645 



(2) INFORHATZON FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Asn Leu Val Leu Ser Glu Cys Gly He Arg Pro Leu Pro Arg 
1 5 10 15 

He Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys Phe 
20 25 30 

Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser Ser Pro Leu 
35 40 45 

Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Ala Leu 
50 55 60 

Asn Val Ser Thr Pro Leu Thr Thr Pro He Phe Glu Glu Ser Pro Leu 
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65 70 75 80 

Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Ala Pro Pro Pro Phe 
85 90 95 

Asn Leu Ala Asp lie Arg Ala Ala He Pro Lys Bis Cys Trp Val Lys 
100 105 110 

Asn Pro Trp Lys Ser Leu Ser Tyr Val Val Arg Asp Val Ala He Val 
115 120 125 

Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu Asn Asn Trp He Val Trp 
130 135 140 

Pro Leu Tyr Trp Leu Ala Gin Gly Thr Met Phe Trp Ala Leu Phe Val 
"5 150 155 160 

Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys Leu 
165 170 175 

Asn Ser Val Val Gly His Leu Leu His Ser Ser He Leu Val Pro Tyr 
IBO 185 190 

His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His 
i95 200 205 

Val Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys He Tyr Asn 
210 215 220 

Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu Val 
225 230 235 240 

Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys Lys 
245 250 255 

Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu Arg 
260 265 270 

Lys Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Ala Leu 
275 280 285 

Leu Val Cys Leu Asn Phe Thr He GXy Pro He Gin Met Leu Lys Leu 
290 295 300 

Tyr Gly He Pro Tyr Trp He Asn Val Met Trp Leu Asp Phe Val Thr 
305 310 315 320 

Tyr Leu His His Hie Cly His Glu Asp Lys Leu Pro Trp Tyr Arg Gly 
325 330 335 

Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg Asp 
340 345 350 

Tyr Gly Leu He Asn Asn He His His Asp He Gly Thr His Val He 
355 360 365 
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His His Leu Phe Pro Gin lie Pro His Tyr Hi8 Leu Val Glu Ala Thr 
370 375 380 

Glu Ala Ala Lye Pro Val Leu Gly Lye Tyr Tyr Arg Glu Pro Aep Lye 
365 390 395 400 

Ser Gly Pro Leu Pro Leu Hie Leu Leu Glu lie Leu Ala Lye Ser lie 
405 410 415 

Lye Glu Aep His Tyr Val Ser Asp Glu Gly Glu Val Val Tyr Tyr Lye 
420 425 430 

Ala Asp Pro Asn Leu Tyr Gly Glu Val Lys Val Arg Ala Asp 
435 440 445 



(2) INFOHMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B> LOCATION: 61. « 1368 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGAGAGTGCA AATAGAACGA CAGAGACTTT TTCCTCTTTT CTTCTTGGGA AGAGGCTCCA 60 

AT6 GCG A6C TCG GTT TTA TCA 6AA TGT GGT TTT AGA CCT CTC CCC AGA 108 
Met Ala Ser Ser Val Leu Ser Glu Cys Gly Phe Arg Pro X«eu Pro Arg 
15 10 15 

TTC TAC CCT AAA CAC ACA ACC TCT TTT GCC TCT AAC CCT AAA CCC ACT 156 
Phe Tyr Pro Lys His Thr Thr Ser Phe Ala Ser Asn Pro Lys Pro Thr 
20 25 . 30 

TTC AAA TTC AAT CCA CCA CTT AAA CCT CCT TCT TCT CTT CTC AAT TCC 204 
Phe Lys Phe Asn Pro Pro Leu Lys Pro Pro Ser Ser Leu Leu Asn Ser 
35 40 45 

CGA TAT GGA TTC TAC TCT AAA ACC AGG AAC TGG GCA TTG AAT GTG GCA 252 
Arg Tyr Gly Phe Tyr Ser Lys Thr Arg Asn Trp Ala Leu Asn Val Ala 
50 55 60 



ACA CCT TTA ACA ACT CTT CAG TCT CCA TCC GAG GAA GAC ACG GAG AGA 300 
Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Thr Glu Arg 
65 70 75 80 
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TTC GAC CCA GGT GC6 OCT CCT CCC TTC AAT TTG GCG GAT ATA AGA GCA 348 
Phe Asp Pro Gly AXa Pro Pro Pro Phe Aen Leu Ala Asp iXe Arg Ala 
85 90 95 

6CC ATA CCT AAG CAT TGT TGG GTT AAG AAT CCA TGG ATG TCT ATG A6T 396 
Ala He Pro Lys His Cys Trp Val Lye Asn Pro Trp Net Ser Met Ser 
100 105 110 

TAT GTT GTC AGA GAT GTT GCT ATG GTC TTT GGA TTG GCT GCT GTT GCT 444 
Tyr Val Val Arg Asp Val Ala He Val Phe Gly Leu Ala Ala Val Ala 
115 120 125 

GCT TAG TTC AAC AAT TGG CTT CTC TGG CCT CTC TAG TGG TTC GCT CAA 492 
Ala Tyr Phe Aan Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Ala Gin 
130 135 140 

GGA ACC ATG TTC TGG GCT CTC TTT GTC CTT GGC CAT GAC TGC GGA CAT 540 
Gly Thr Met Phe Trp Ala Leu Phe Val Leu Gly Hie Asp Cys Gly His 
145 150 155 160 

GGT AGC TTC TCG AAT GAT CCG AGG CTG AAC AGT GTG GCT GGT CAT CTT 588 
Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Val Ala Gly His Leu 
165 170 175 

CTT CAT TCC TCA ATT CTG GTC CCT TAC CAT GGC TGG AGG ATT AGC CAC 636 
Leu His Ser Ser He Leu Val Pro Tyr His Gly Trp Arg He Ser Bis 
180 185 190 

AGA ACT CAC CAC GAG AAC CAT GGT CAT GTC GAG AAT GAC GAA TCA TGG 684 
Arg Thr His His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp 
195 200 205 

CAT CCT TTG CCT GAA AGC ATC TAC AAG AAT TTG GAA AAG ACG ACT CAA 732 
His Pro Leu Pro Glu Ser He Tyr Lys Asn Mu Glu Lys Thr Thr Gin 
210 215 220 

ATG TTT AGG TTT ACA CTG CCT TTT CCA ATG CTC GCA TAC CCT TTC TAC 780 
Met Phe Arg Phe Thr Leu Pro Phe Pro Met Leu Ala Tyr Pro Phe Tyr 
225 230 235 240 

TTG TGG AAC AGA AGT CCA GGG AAA CAA GGT TCT CAT TAT CAT CCG GAC 828 
Leu Trp Asn Arg Ser Pro Gly Lys Gin Gly Ser His Tyr His Pro Asp 
245 250 255 

AGT GAC TTG TTT CTT CCA AAA GAG AAG AAA GAT GTT CTG ACA TCA ACT 876 
Ser Asp Leu Phe Leu Pro Lys Glu Lys Lys Asp Val Leu Thr Ser Thr 
260 265 270 

GCC TGT TGG ACT GCA ATG GCT GCT TTG CTT GTT TGT CTC AAC TTT GTC 924 
Ala Cys Trp Thr Ala Met Ala Ala Leu Leu Val Cys Leu Asn Phe Val 
275 280 285 

ATG GGT CCA ATC CAG ATG CTC AAA CTA TAT GGC ATC CCT TAT TGG ATA 972 
Met Gly Pro He Gin Met Leu Lys Leu Tyr Gly He Pro Tyr Trp He 
290 295 300 
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TTT GTA ATG TGG TTG GAC TTC GTC ACT TAC TTG CAC CAC CAT GGA CAT 1020 
Phe Val Met Trp Leu Asp Phe Val Thr Tyr Leu Hia Hie Hie Gly Hie 
305 310 315 320 

GAA GAC AAG CTC CCT TGG TAT CGT GGA AAG GAA TGG AGT TAC CTG AGA 1068 
Glu Aep Lye Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg 
325 330 335 

GGA GGG CTC ACA ACA TTA GAT CGT GAC TAC GGA TGG ATC AAT AAC ATC 1116 
Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp lie Asn Asn lie 
340 345 350 

CAC CAC GAT ATT GGA ACT CAT GTG ATA CAT CAT CTT TTC CCG CAG ATC 1164 
His His Asp lie Gly Thr His Val lie His His Leu Phe Pro Gin lie 
355 360 365 

CCA CAT TAT CAT CTA GTA GAA GCA ACA GAA GCA GCT AAA CCA GTA CTA 1212 
Pro His Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu 
370 375 380 

GGA AAG TAC TAC AGA GAA CCG AAA AAC TCT GGA CCT CTG CCA CTT CAC 1260 
Gly Lys Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro Leu His 
385 . 390 .. .. 395 400 

TTA CTG GGA AGC CTC ATA AAG AGT ATG AAA CAA GAC CAT TTC GTA AGC 1308 
Leu Leu Gly Ser Leu lie Lys Ser Met; Lys Gin Asp His Phe Val Ser 
405 410 415 

GAT ACA GGA GAT GTC GTG TAC TAT GAG GCA GAT CCA AAA CTC AAT GGA 1356 
Asp Thr Gly Asp Val Val Tyr Tyr Glu Ala Asp Pro Lys I^u Asn Gly 
420 425 430 

CAA AGA ACA TGAGGACATA CTGCAGTGAA CCAGGCAGAC AAGTTACATA 1405 
Gin Arg Thr 
435 

AATTCATCTT GGCCCATTCA TTATGTTCTT TTTGTTTTGG TGTAAAGCCT TTTCGAGATT 1465 
AAAAAAGCAT TAATTTGTAG AAACCTGTGG TAAAACTCTC GATCAAATGA AATAAGATAT 1525 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 435 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala ser Ser Val Leu Ser Glu Cys Gly Phe Arg Pro Leu Pro Arg 
15 10 15 
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Phe Tyx Pro l»yB Hie Thr Thr Ser Phe Ala Ser Asn Pro Liye Pro Thr 
20 25 30 

Phe Lye Phe Asn Pro Pro Leu Lye Pro Pro Ser Ser Leu heu Aan Ser 
35 40 45 

Arg Tyr Gly Phe Tyr Ser Lye Thr Arg Aen Trp Ala Leu Aan Val Ala 
50 55 60 

Thr Pro I«eu Thr Thr Leu Gin Ser Pro Ser Glu Glu Aap Thr Glu Arg 
65 70 75 80 

Phe Aap Pro Gly Ala Pro Pro Pro Phe Aan Leu Ala Aap lie Arg Ala 
85 90 95 

Ala lie Pro Lya Hie Cys Trp Val Lya Asn Pro Trp Met Ser Met Ser 
100 105 110 

Tyr Val Val Arg Aap Val Ala lie Val Phe Gly Leu Ala Ala Val Ala 
115 120 125 

Ala Tyr Phe Aan Aan Trp Leu Leu Trp Pro Leu Tyr Trp Phe Ala Gin 
130 135 140 

Gly Thr Met Phe Trp Ala Leu Phe Val Leu Gly Hia Aap Cya Gly Hie 
145 150 155 160 

Gly Ser Phe Ser Aan Aap Pro Arg Leu Aan Ser Val Ala Gly Hia Iau 
165 170 . 175 

Leu Hia Ser Ser lie Leu Val Pro Tyr Hia Gly Trp Arg lie Ser Hia 
180 185 190 

Arg Thr Hia Hia Gin Aan Hia Gly Hia Val Glu Aan Aap Glu Ser Trp 
195 200 205 

Hia Pro Leu Pro Glu Ser' lie Tyr Lya Aan Leu Glu Lya Thr Thr Gin 
210 215 220 

Met Phe Arg Phe Thr Leu Pro Phe Pro Met Leu Ala Tyr Pro Phe Tyr 
225 230 235 240 

Leu Trp Aan Arg Ser Pro Gly Lya Gin Gly Ser Hia Tyr His Pro Aap 
245 250 255 

Ser Aap Leu Phe Leu Pro Lya Glu Lya Lya Aap Val Leu Thr Ser Thr 
260 265 270 

Ala Cya Tzp Thr Ala Met Ala Ala Leu Leu Val Cya Leu Aan Phe Val 
275 280 285 

Met Gly Pro lie Gin Met Leu Lya Leu Tyr Gly lie Pro Tyr Trp lie 
290 295 300 

Phe Val Met Trp Leu Asp Phe Val Thr Tyr Leu Hia Hia Hia Gly Hia 
305 310 315 320 
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Glu Asp Lys I«eu Pro Trp Tyr Arg Gly Xys Glu Trp Ser Tyr Vbm Arg 
325 330 335 

Gly Gly Leu Thr Thr Leu Aep Arg Aap Tyr Gly Trp lie Asn Asn lie 
340 345 350 

His Hi.8 Asp lie Gly Thr His Val lie His His Leu Phe Pro Gin lie 
355 360 365 

Pro His Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu 
370 375 380 

Gly Lys Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro I«eu His 
385 390 395 400 

Leu Leu Gly Ser Leu lie Lys Ser Met Lys Gin Asp His Phe Val Ser 
405 410 415 

Asp Thr Gly Asp Val Val Tyr Tyr Glu Ala Asp Pro Lys Leu Asn Gly 
420 425 430 

Gin Arg Thr 
435 



(2) INFORMATION FOR -SEQ ID NO: 13: - 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
< B ) -TYPE : nucleic acid - - 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAYATHMGNG CNGCNATHCC 20 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCNATHCCNA ARCAYTG 17 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 



PCT/US94/01321 



-84- 

(2) INFORMATION FOR SEQ ID NOslSs 

(i) SEQUENCE CHARACTERISTICS: 

(A) X«£NGTHs 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECOXiE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AARCAYT6YT GGGTNAA 17 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIXXSY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGGTTYYTNT GGCCNYTNTA YTGG 24 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGGTTYYTNT GGCCN 15 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MC^CULE TYPE: DNA (synthetic) 



<xl) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TG6CCNYTNT AYTGG 15 
(2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ( synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TGGGTNGCNC ARGGNAC 17 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (synthetic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTYGTNYTNG GNCAYGA 17 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTNYTNGGNC AYGAYTG 17 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



GGNCAYGAYT GY6GNCA 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



17 



(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
TGYGGNCAYG GNWSNTT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCNTAYCAYG GNTGG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPK: DNA (synthetic) 



(xl) SEQXJENCE DESCRIPTION: SEQ IV NOs25: 
CAYGGNTGGM 6NATHWSNCA 20 
(2) INFORMATION FOR SEQ ID NO: 26: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TGGMGNATHT CNCAYM6NAC NCAYCA "26 
(2) INFORMATION FOR SEQ ID NO:27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid . ^ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
TGGMGNATHA GYCAYMGNAC NCAYCA 26 
(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
TGGMGNATHW SNCAY 15 
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(2) INFORMATION FOR SEQ ID NOs29: 

(1) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 15 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CAYKGNACNC AYCAY 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GARAAYGAYG ARWSNTG6 
(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:. 
6AYGARWSNT GGGTNCC 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPES DMA (Bynthetic) 



PCT/OS94/01321 
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(xi) SEQUENCE DESCRIPTZON: SEQ ZD NO:32t 
NGTNACNGCR TCNARCCA 
(2) INFORMATION FOR SEQ ID NO: 33s 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
RT6RT6NARR TANGT 15 
(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
ARNCCNCCNC KNARRTARCT CCA 23 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ARNCCNCCNC KNARRTANGA CCA 23 
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(2) INFORMATION FOR SEQ ZD NO: 36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
RTCNCKRTCD ATNGTNGTNA 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
RTARTCNCKR TCDATNGT 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
RTGNGTNCCD ATRTCRTG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DHA <Bynthetie) 



<xi) SEQUENCE DESCRIPTIONS SEQ ID N08 39l 
NARRTGRTGD ATNACRTG 
(2) INFORMATION FOR SEQ ID NO s 40s 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 21 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOs40: 
DATYTGNG6R AANARRT6RT 6 . ^ 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 20 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOs41: 
GGDATYTGNG 6RAANARRTG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
RTARTGNGGD ATYTGNG6RA ANA 
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(2) INFORMHTION FOR SEQ ID N08 43s 

(1) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO s 43 s 

Asp lie Arg Ala Ala lie Pro 

1 5 

(2) INFORMATION FOR SEQ ID NOs44s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ala lie Pro Lye His Cys 

1 5 . 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Lys His Cys Trp Val Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Trp Phe Leu Trp Pro Leu Tyr Trp 
1 5 

(2) INFORMATION FOR SEQ ID NOs47: 

(i) SEQXIENCE CHARACTERISTICS s 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Trp Phe Leu Trp Pro 
i 5 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid — 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Trp Pro Leu Tyr Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Trp Val Ala Gin Gly Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 
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(1) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOsSO: 

Trp Val Ala Gin Gly Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: Sis 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Val Leu Gly His Asp Cys 
1 S 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly His Asp Cys Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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<xi) SEQUENCS DESCRIPTIONS SEQ ID MO: 53: 

Cys Gly HlB Gly Ser Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Pro Tyr Hie Gly Trp 
15 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

His Gly Trp Arg lie. Ser His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Trp Arg lie Ser His Arg Thr His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTHS 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

Trp Arg lie Ser His 
1 5 

(2) INFORMATION FOR SEQ ID NO:58: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

His Arg Thr His His 
1 5 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Glu Asn Asp Glu Ser Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Asp Glu Ser Trp Val Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Trp Leu Asp Ala Val Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: S amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Thr Tyr Leu Hie Hie 
1 5 

(2) INFORMATION FQR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Trp Ser Tyr Leu Arg Gly Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) I.EN6TH: 7 amino acids 

(B) TYPES amino acid 
(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Thr Thr He Asp Arg Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Thr He Asp Arg Asp Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

His Asp He Gly Thr His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DBSCRZPTZON: SEQ ZD NO: 67: 

His Val Zle Hie His Leu 
1 5 

(2) ZNFORHATZON FOR SEQ ZD NO: 68: 

(i) SEQUENCE CHARACTERZSTZCS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TTPE: peptide 



(xi) SEQUENCE DESCRIPTZON: SEQ ZD NO: 68: 

His His Leu Phe Pro Gin Zle 
1 5 

(2) ZNFORMATZON FOR SEQ ZD NO: 69: 

(i) SEQUENCE CHARACTERZSTZCS: 
<A) LENGTH: 7 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRZPTZON: SEQ ZD NO: 69: 

His I«eu Phe Pro Gin Zle Pro 
1 5 

(2) ZNFORMATZON FOR SEQ ZD NO: 70: 

(i) SEQUENCE CHARACTERZSTZCS: 

(A) liENGTK: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRZPTZON: SEQ ZD NO: 70: 

Leu Phe Pro Gin Zle Pro His Tyr 
1 5 

(2) ZNFORMATZON FOR SEQ ZD NO: 71: 

( i ) SEQUENCE CHARACTERZSTZCS : 
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(A) I^NGTH: 1670 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEONSSSs double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 46.. 1302 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CAAACTCTCT CGGGGGGTOG CTTCTTCTGC ATTTTCTGCT TCCCA ATG GCT TCC 54 

Met Ala Ser 
1 

AGA ATT GCT GAT TCT CTC TTC GCC TTC ACG GGC CCA CAG CAA TGT CTT 102 
Arg He Ala Asp Ser Leu Phe Ala Phe Thr Gly Pro Gin Gin Cys Leu 
5 10 15 

CCT AGG GTT CCT AAG CTT GCT GCT TCT TCT QCT CGT GTT TCT CCT GGT 150 
Pro Arg Val Pro Lys Leu Ala Ala Ser Ser Ala Arg Val Ser Pro Gly 
20 25 30 35 

GTA TAT GCT GTG MiG CCG ATT GAT CTT CTG TTA AAA GGA CGA ACT CAT 198 
Val Tyr Ala Val Lye Pro lie Asp Leu Leu Leu Lys Gly Arg Thr Hia 
40 45 50 

CGA AGT AGA AGA TGT GTA GCT CCT GTG AAA AGG AGA ATT GGA TGT ATC 246 
Arg Ser Arg Arg Cys Val Ala Pro Val Lys Arg Arg lie Gly Cys lie 
55 60 65 

AAA GCG GTG GCT GCT CCA GTT GCA CCG CCT TCA GCT GAC AGT GCA GAA 294 
Lys Ala Val Ala Ala Pro Val Ala Pro Pro Ser Ala Asp Ser Ala Glu 
70 75 80 

GAC AGG GAA CAG TTA GCA GAA AGC TAT GGA TTC AGA CAA ATT GGA GAA 342 
Asp Arg Glu Gin Leu Ala Glu Ser Tyr Gly Phe Arg Gin lie Gly Glu 
85 90 95 

GAT CTT CCT GAG AAT GTC ACC TTA AAA GAT ATC ATG GAT ACA CTT CCC 390 
Asp Leu Pro Glu Asn Val Thr Leu Lys Asp lie Met Asp Thr Leu Pro 
100 105 110 115 

AAA GAG GTG TTT GAG ATT GAT GAT CTG AAA GCT TTC AAG TCT GTG TTG 438 
Lys Glu Val Phe Glu lie Asp Asp Leu Lys Ala Leu Lys Ser Val Leu 
120 125 130 

ATA TCT GTG ACT TCA TAC ACT TTG GGG CTC TTC ATG ATT GCA AAA TCG 486 
lie Ser Val Thr Ser Tyr Thr Leu Gly Leu Phe Met lie Ala Lys Ser 
135 140 145 

CCG TGG TAT CTG CTA CCG TTG GCT TGC GCA TGG ACA GGA ACT OCA ATT 534 
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Pro Trp Tyr Z«eu Leu Pro Leu Ala Trp Ala Trp Thr Gly Thr Ala lie 
ISO 155 160 

ACC GGG TTC TTT GTG ATA GGT CAT GAT TGT GCA CAT AAG TCA TTT TCA 582 
Thr Gly Phe Phe Val lie Gly Hie Asp Cye Ala Hie Lye Ser Phe Ser 
165 170 175 

AAG AAC AAA TTG GTG GAA GAC ATT GTG GGT ACT CTC 6CC TTC CTA CCA 630 
Lye Aen Lye Leu Val Glu Asp lie Val Gly Thr Leu Ala Phe Leu Pro 
180 185 190 195 

CTT GTC TAC CCA TAT GAG CCA TGG CGG TTT AAG CAC GAC CGC CAT CAC 678 
Leu Val Tyr Pro Tyr Glu Pro Trp Arg Phe Lye Hie Aep Arg Hie His 
200 205 210 

GCC AAA ACC AAC ATG TTA CTT CAT GAC ACA GCT TGG CAG CCA GTT CCG 726 
Ala Lys Thr Asn Met Leu Leu His Asp Thr Ala Trp Gin Pro Val Pro 
215 220 225 

CCA GAG GAG TTT GAG TCA TCA CCC GTG ATG AGA AAG GCA ATC ATT TTT 774 
Pro Glu Glu Phe Glu Ser Ser Pro Val Met Arg Lys Ala lie Xle Phe 
230 235 240 

GGA TAT GGC CCA ATT AGA CCT TGG TTG TCC ATA GCT CAC TGG GTG AAC 822". 
Gly Tyr Gly Pro lie Arg Pro Trp Leu Ser lie Ala His Trp Val Asn 
245 250 255 

TGG CAC TTC AAT CTG AAA AAG TTC AGA GCG A6C GAG GTG AAT AGG GTG 870 
Trp His Phe Asn Leu Lye Lys Phe Arg Ala Ser Glu Val Aen Arg Val' - 
260 265 270 275 

AAG ATA AGT TTG GCT TGT GTT TTC GCC TTC ATG GCC GTT GQG TGG CCA 918 
Lys lie Ser Leu Ala Cys Val Phe Ala Phe Met Ala Val Gly Trp Pro 
280 285 290 

CTG ATC GTA TAC AAA GTT GGT ATA TTG GGA TGG GTA AAA TTC TGG TTA 966 
Leu He Val Tyr Lys Val Gly He Leu Gly Trp Val Lys Phe Trp Leu 
295 300 305 

ATG CCA TGG TTG GGC TAT CAC TTC TGG ATG AGC ACA TTC ACA ATG GTT 1014 
Met Pro Trp Leu Gly Tyr His Phe Trp Met Ser Thr Phe Thr Met Val 
310 315 320 

CAT CAT ACG GCT CCG CAT ATA CCT TTC AAG CCT GCG GAT GAG TGG AAC 1062 
His His Thr Ala Pro His He Pro Phe Lys Pro Ala Asp Glu Trp Asn 
325 330 335 

GCG GCT CAG GCC CAG CTG AAT GGA ACT GTT CAT TGT GAC TAC CCT AGT 1110 
Ala Ala Gin Ala Gin Leu Asn Gly Thr Val His Cys Asp Tyr Pro Ser 
340 345 350 355 

TGG ATT GAA ATT CTC TGC CAT GAT ATC AAC GTT CAC ATC CCG CAT CAT 1158 
Trp He Glu He Leu Cys His Asp He Asn Val His He Pro His His 
360 365 370 

ATT AGC CCA ACA ATA CCG AGC TAC AAT CTC CGT GCA GCT CAT GAG TCT 1206 
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Ile Ser Pro Arg lie Pro Ser Tyr Asn I«eu Arg Ala Ala Hie Glu Ser 
375 380 385 

ATA CAA GAG AAC TGG GGA AAG TAT ACA AAC TTG GCT ACA TGG AAC TGG 1254 
lie Gin Glu Asn Trp Gly Lye Tyr Thr Asn Leu Ala Thr Trp Aen Trp 
390 395 400 

CGA TTG ATG AAG ACG ATA ATG ACT GTG TGT CAT GTC TAT GAC AAA TAGGAGAACT 
1309 

Arg Leu Met Lye Thr lie Met Thr Val Cye His Val Tyr Asp Lys 
405 410 415 

ACATTCCTTT TGACCGGTTA GCCCCTGAAG AATCTCAGCC AATAACCTTC CTCAAGAAAT 1369 

CAATGCCTAA CTACACAGCC TGATTOGCCA TGGTCTCAAA CTAGTCTTTT GAAATCTCAA 1429 

TATCTTTTTG CAGTCGCC6A TGTTATATGT AAGCTTTCCA AGCGATGAGC TTCTCTAACA 1489 

CTTCACCAAC 6CTTTATACT GTTATCTTCT TTCCAATCTT ATCAGAAGAG AGAAACTGGT 1549 

CAAATTATCT GAGCGATT6C AATTCTTTTA TCAGTTTCTT AGCTATAA6A AGATTGAACA 1609 

GTCTATATAG TTTGCAATGT ACTGTAATGT GATGAAAATT TAGTTGATGA GAAAAAAAAA 1669 

^ 1670 

(2) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met Ala Ser Arg lie Ala Asp Ser Leu Phe Ala Phe Thr Gly Pro Gin 
^ 5 10 15 

Gin Cys Leu Pro Arg Val Pro Lys Leu Ala Ala Ser Ser Ala Arg Val 
20 25 30 

Ser Pro Gly Val Tyr Ala Val Lys Pro lie Asp Leu Leu Leu Lys Gly 
35 40 45 

Arg Thr His Arg Ser Arg Arg Cys Val Ala Pro Val Lys Arg Arg lie 
50 55 60 

Gly Cys He Lys Ala Val Ala Ala Pro Val Ala Pro Pro Ser Ala Asp 
65 70 75 80 

Ser Ala Glu Asp Arg Glu Gin Leu Ala Glu Ser Tyr Gly Phe Arg Gin 
85 90 95 
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lie Gly Glu Asp heu Pro Glu Asn Val Thr Vbu Lys Asp lie Met Asp 
100 105 110 

Thr Leu Pro Lys Glu Val Phe Glu lie Aep Asp Leu Lys Ala Leu Lys 
115 120 125 

Ser Val Leu lie Ser Val Thr Ser Tyr Thr Leu Gly Leu Phe Met lie 
130 135 140 

Ala Lys Ser Pro Trp Tyr Leu Leu Pro Leu Ala Trp Ala Trp Thr Gly 
145 150 155 160 

Thr Ala lie Thr Gly Phe Phe Val lie Gly Bis Asp Cys Ala His Lys 
165 170 175 

Ser Phe Ser Lys Asn Lys Leu Val Glu Asp Zle Val Gly Thr Leu Ala 
180 185 190 

Phe Leu Pro Leu Val Tyr Pro Tyr Glu Pro Trp Arg Phe Lys His Asp 
195 200 205 

Arg His His Ala Lys Thr Asn Met Leu Leu His Asp Thr Ala trp Gin 
210 215 220 

Pro Val Pro Pro Glu Glu Phe Glu Ser Ser Pro Val Met Arg Lys Ala 
225 230 235 240 

lie lie Phe Gly Tyr Gly Pro lie Arg Pro Trp Leu Ser lie Ala His 

. 245 . 250 255 

Trp Val Asn Trp His Phe Asn Leu Lys Lys Phe Arg Ala Ser Glu Val 
260 265 270 

Asn Arg Val Lys Zle Ser Leu Ala Cys Val Phe Ala Phe Met Ala Val 
275 280 285 

Gly Trp Pro Leu lie Val Tyr Lys Val Gly He Leu Gly Trp Val Lys 
. 290 295 300 

Phe Trp Leu Met Pro Trp Leu Gly Tyr His Phe Trp Met Ser Thr Phe 
305 310 315 320 

Thr Met Val His His Thr Ala Pro His He Pro Phe Lys Pro Ala Asp 
325 330 335 

Glu Trp Asn Ala Ala Gin Ala Gin Leu Asn Gly Thr Val His Cys Asp 
340 345 350 

Tyr Pro Ser Trp He Glu He Leu Cys His Asp He Asn Val His He 
355 360 365 

Pro His His He Ser Pro Arg He Pro Ser Tyr Asn Leu Arg Ala Ala 
370 375 380 

His Glu Ser lie Gin Glu Asn Trp Gly Lys Tyr Thr Asn Leu Ala Thr 
385 390 395 400 
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Trp Asn Trp Arg Leu Met Lys Thr lie Met Thr Val Cys His Val Tyr 
405 410 415 

Asp LyB 
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Claims: 

1. A genetically transformed plant which has an elevated 
linolenic add content comprising a recombinant, double-stranded DNA 

5 molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a structxiral coding sequence that causes the 
10 production of an RNA sequence that encodes a linoleic 

acid desaturase activity; and 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 

- sequence. - 

15 2. The plant of claim 1 in which the linoleic acid desaturase 

activity is from plants. - 

3. The plant of claim 1 in which the linoleic acid desaturase 
activity is from fimgi, algae or bacteria. 

4. The plant of claim 1 in which the structural coding 
20 • sequence of (ii) is taken from SEQ. ID NO:l. 

5. The plant of claim 1 in which the structural coding 
sequence of (ii) is taken from SEQ. ID N0:9. 

6. The plant of claim 1 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NOrll. 

25 7. The plant of claim 1 in which the promoter of (i) is an 

endogenous plant linoleic acid desaturase promoter. 

8. A genetically transformed plant which has a reduced 
linolenic acid content, comprising a recombinant, double-stranded DNA 
molecule comprising 
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(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said proihoter 
operably linked to; 

(ii) a DNA sequence that causes the production of an 
5 RNA sequence that is in antisense orientation to at least 

a portion of a gene that encodes a linoleic acid desaturase 
activity in said plant; and 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3* end of said RNA 

10 sequence. 

9- The plant of claim 8 in which the Hnoleic acid desaturase 
enzyme is from plants. 

10. The plant of claim 8 in which the hnoleic acid desaturase 
enzyme is from fungi, algae or bacteria. 

11- The plant of claim 8 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:l. 

12. The plant of claim 8 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:9. 

13. The plant of claim 8 in which the structural coding 
20 sequence of (ii) is taken from SEQ. 8 ID NO:ll. 

14. The plant of claim 8 in which the promoter of (i) is an 
endogenous plant linoleic add desaturase promoter. 

15. A genetically transformed plant which has an improved 
resistance to low temperatm-es comprising a recombinant, double-stranded 

25 DNA molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably Unked to; 
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(ii) a structural coding sequence that causes the 
production of an RNA sequence that encodes a liholeic 
add desaturase activity; and 

(iii) a 3* non-translated region that functions in plant 
cells to promote pblyadenylation to the 3' end of said RNA 
sequence. 

16. A genetically transformed plant which has an elevated 
ahility to respond to pathogens, comprising a recombinant, double-stranded 
DNA molecule comprising 

10 (i) a promoter that functions in plant cells to cause 

.the production of an RNA sequence, said promoter 
. operably li^ 
. (ii) a structural coding sequence that causes the 
production of an RNA sequence that encodes a linoleic 
15 „ - add desaturase activity; and : . - 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

17. A seed produced from genetically transformed plant where 
20 said seed has an linolenic acid content suitable for use as a source of 

linolenic add, said plant comprising a recombinant, double-stranded DNA 
molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 

25 operably linked to; 

(ii) a structural coding sequence that causes the 
production of an RNA sequence that encodes a linoleic 
add desaturase activity; and 
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(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3* end of said RNA 
sequence. 

18. The seed of claim 17 where said plant is selected from the 
5 group consisting of soybean and rapeseed. 

19. A genetically transformed plant which has a linolenic acid 
content of less than about 3%, said plant comprising a recombinant, 
double-stranded DNA molecule comprising 

(i) a promoter that functions in plant cells to cause 
10 the production of an RNA sequence, said promoter 

operably linked to; 

(ii) a DNA sequence that causes the production of an 
KNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a linoleic add desaturase 

15 activity in said plant; and 

(iii) a 3* non-translated region that functions in plant 
cells to promote polyadenylation to the 3* end of said RNA 
sequence. 

20. A genetically transformed plant which has an increased 
20 oleic acid content, comprising a recombinant, double-stranded DNA 
molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

25 (ii) a DNA sequence that causes the production of an 

RNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a oleic acid desaturase 
activity in said plant; and 
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(iii) a 3* non^translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

21. A genetically transformed plant which has an increased 
5 oleic acid content, comprising a recombinant, double-stranded DNA 

molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

10 (ii) a DNA sequence that causes the production of an 

BNA sequence that, is in antisense orientation to at least 
a portion of a gene, that encodes a Unoleic acid desatiu-ase - 
- activity in said plant; and 
— - - ; (iii) a 3*-non*translated region that functions in plant 
15 _ - cells to promote polyadenylation to the 3* end of said- RNA 

sequence. - - 

22. A method of producing a genetically transformed plant 
which has an elevated linolenic add content, comprising 

. (a) inserting into the genome of a plant cell a 

20 recombinant, double-stranded DNA molecule comprising: 

(i) a promoter that functions in plant cells to 
cause the production of an RNA sequence, said 
promoter operably Unked to; 

(ii) a structural coding sequence that causes 
25 the production of an RNA sequence that encodes 

a linoleic acid desaturase activity; and 

(iii) a 3' non-translated region that functions in 
plant cells to promote polyadenylation to the 3' 
end of said RNA sequence; 
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(b) obtaining transformed plant cells; and 

(c) regenerating from the transformed plant cells 
genetically transformed plants which have an elevated 
Hnolenic add content. 

5 23. The method of claim 22 in which the Hnoleic acid 

desaturase enzyme is from plants. 

24. The method of claim 22 in which the linoleic acid 
desattirase enzyme is from fimgi, algae or bacteria. 

25. The method of claim 22 in which the structxiral coding 
10 sequence of (ii) is taken from SEQ. ID NO:l. 

26. The method of claim 22 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:9. 

27. The method of claim 22 in which the structural coding 
sequence of (ii) is taken fit)m SEQ. ID NO:ll. 

^® 28. The plant of dahn 22 in which the promoter of (i) is an 

radogenous plant Unoleic add desaturase promoter. 

29. A method of producing a genetically transformed plant 
which has a reduced hnolenic add content, comprising 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule comprising: 

(i) a promoter that functions in plant cells to 
cause the production of an RNA sequence, said 
promoter operably linked to; 

(ii) a DNA sequence that causes the 
2^ production of an RNA sequence that is in 

antisense orientation to at least a portion of a 
gene that encodes a linoleic acid desaturase 
activity in said plant; and 
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(iii) a 3' non-translated region that functions in 
plant cells to promote polyadenylation to the 3* 
end of said RNA sequence 
(h) obtaining transformed plant cells; and 
5 (c) regenerating from the transformed plant cells 

genetically transformed plants which have a reduced 
linolenic acid content. 
30. The method of claim 29 in which the linoleic acid 
desaturase enzyme is firom plants. 
10 31. The method of claim 29 in which the linoleic acid 

desaturase enzyme is from fungi, algae or bacteria. 

32. The method of claim 29 in which the structural coding 
sequence" of (ii) is taken from SEQ. ID NO:l. 

33. The method of claim 29 in which the structural coding 
15 sequence of (ii) is taken from SEQ. ID NO:9^^ 

34. The method of claim 29 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:ll. 

35. The plant of claim 29 in which the promoter of (i) is an 
endogenous plant linoleic add desaturase promoter. 

20 36. A method of producing a genetically transformed plant 

which has an increased oleic acid content, comprising 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule comprising: 

(i) a promoter that functions in plant cells to 
25 cause the production of an RNA sequence, said 

promoter operably linked to; 

(ii) a DNA sequence that causes the 
projduction of an RNA sequence that is in 
antisense orientation to at least a portion of a 
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gene that encodes a linoleic acid desaturase 
activity in said plant; and 

(iii) a 3' non-translated region that functions in 
plant cells to promote polyadenylation to the 3* 
5 end of said RNA sequence 

(b) obtaining transformed plant cells; and 

(c) regenerating from the transformed plant cells 
genetically transformed plants which have an increased 
oleic add content. 

10 37. A recombinant, double-stranded DNA molecule 

comprising in sequence: 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

1^ (ii) a structural coding sequence that causes the 

production of an RNA sequence that encodes a linoleic 
acid desaturase activity; and 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3* end of said RNA 
20 sequence. 

38. A recombinant, double-stranded DNA molecule 
comprising in sequence: 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 

25 operably linked to; 

(ii) a DNA sequence that causes the production of an 
RNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a linoleic add desaturase 
activity in said plant; and 
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(iii) a 3' non-translated region that functions in plant 
ceUs to promote polyadenylation to the 3* end of said UNA 
sequence. 

39. A plant cell comprising a recombinant, double- 
5 stranded DNA molecule comprising in sequence: 

(i) a promoter that fimctions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a DNA sequence that causes the production of an 
10 RNA sequence that is in antisense orientation to at least 

a portion of a gene that encodeis a linoleic acid desaturase 
activity in said plant; and 

(iii) a 3' non-translated region that functions in plant 
- ~ ceUs to promote polyadenylation to the 3' end 

15 ' - ^ sequence. 

40. A method of producing a genetically transformed plant 
which has an increased oleic acid content, comprising 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule comprising: 
20 (i) a promoter that functions in plant cells to 

caiise the production of ah RNA sequence, said 

promoter operably linked to; 

(ii) a DNA sequencei that causes the 
production of an RNA sequence that is in 
25 antisense orientation to at least a portion of a 

gene that encodes a oleic acid desaturase activity 
in said plant; and 
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(iii) a 3' non-translated region that functions in 
plant cells to promote polyadenylation to the 3* 
end of S€ud RNA sequence 

(b) obtaining transformed plant cells; and 

(c) regenerating from the transformed plant cells 
genetically transformed plants which have an increased 
oleic add content. 
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AATCCATCAA ACCTTTATTC ACCACATTTC ACTGAAAGGC CACACATCTA GAGAGAGAAA 60 

CTTCGTCCAA ATCTCTCTCT CCAGCG ATG GTT GTT GCT ATG GAC GAG CGC AGC 113 

Mel Vol Vol Alo Met Asp Gin Arg Ser 
1 5 

AAT GTT AAC GGA GAT TCC GGT GCC CGG AAG GAA GAA GGG TTT GAT CCA . 161 
Asn Vol Asn Gly Asp Ser Gly Ala Arg Lys Glu Glu Gly Phe Asp Pro 
10 15 20 25 

AGC GCA CAA CCA CCG TTT AAG ATC GGA GAT ATA AGG GCG GCG ATT CCT 209 
Ser Alo Gin Pro Pro Phe Lys He Gly Asp He Arg Alo Alo lie Pro 
30 35 40 

AAG CAT TGC TGG GTG AAG ACT CCT TTG AGA TCT ATG AGC TAC GTC ACC 257 
Lys His Cys Trp Vol Lys Ser Pro Leu Arg Ser Met Ser Tyr Vol Thr 
45 50 55 

AGA GAC ATT TTC GCC GTC GCG GCT CTG GCC ATG GCC GCC GTG TAT TTT 305 
Arg Asp He Phe Alo Vol Alo Alo Leu Alo Met Alo Alo Vol Tyr Phe 

_60- - 65 ... 70 . 

GAT AGC TGG TTC CTC TGG CCA CTC TAC TGG GTT GCC CAA GGA ACC CTT 353 
Asp Ser Trp Phe Leu Trp Pro Leu Tyr Trp Vol Alo Gin Gly Thr Leu 
75 . 80 85 

TTC TGG GCC ATC TTC GTT CTT GGC CAC GAC TGT GGA CAT GGG AGT TTC 401 
Phe Trp Alo He Phe Vol Leu Gly His Asp Cys Gly His Gly Ser Phe 
90 95 100 105 

TCA GAC ATT CCT CTG CTG AAC AGT GTG GTT GGT CAC ATT CTT CAT TCA 449 
Ser Asp He Pro Leu Leu Asn Ser Vol Vol Gly His He Leu His Ser 
110 115 120 

TTC ATC CTC GTT CCT TAC CAT GGT TGG AGA ATA AGC CAT CGG ACA CAC 497 
Phe He Leu Vol Pro Tyr His Gly Trp Arg He Ser His Arg Thr His 
125 130 135 

CAC CAG AAC CAT GGC CAT GTT GAA AAC GAC GAG TCT TGG GTT CCG TTG 545 
His Gin Asn His Gly His Vol Glu Asn Asp Glu Ser Trp Vol Pro Leu 
140 145 150 

FIG.3a 
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CCA GM AAG TTG TAC AAG AAC TTG CCC CAT AGT ACT CGG ATG CTC AGA 593 
Pro Glu Lys Leu Tyr Lys Asn Leu Pro His Ser Thr Arg Met Leu Arg 
155 160 165 

TAC ACT GTC CCT CTG CCC ATG CTC GCT TAC CCG ATC TAT CTG TGG TAC 641 
Tyr Thr Vol Pro Leu Pro Met Leu Alo Tyr Pro Me Tyr Leu Trp Tyr 
170 175 180 185 

AGA AGT CCT GGA AAA GAA GGG TCA CAT TTT AAC CCA TAC AGT AGT TTA 689 
Arg Ser Pro Gly Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser Leu 
190 195 200 

TTT GCT CCA AGC GAG AGG AAG CTT ATT GCA ACT TCA ACT ACT TGC TGG 737 
Phe Alo Pro Ser Glu Arg Lys Leu He Alo Thr Ser Thr Thr Cys Trp 
205 210 215 

TCC ATA ATG TTG GCC ACT CTT GTT TAT CTA TCG TTC CTC GTT GAT CCA 785 
Ser He Met Leu Alo Thr Leu Vol Tyr Leu Ser Phe Leu Vol Asp Pro 
220 225 230 

GTC ACA GTT CTC AAA GTC TAT GGC GTT CCT TAC ATT ATC TTT GTG ATG 833 
Vol Thr Vol Leu Lys Vol Tyr Gly Vol Pro Tyr lie He Phe Vol Mel 
235 240 245 

TGG TTG GAC GCT GTC ACG TAC TTG CAT CAT CAT GGT CAC GAT GAG AAG 881 
Trp Leu Asp Alo Vol Thr Tyr Leu His His His Gly His Asp Glu Lys 
250 255 260 265 

TTG CCT TGG TAC AGA GGC AAG GAA TGG AGT TAT TTA CGT GGA GGA TTA 929 
Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 
270 275 280 

ACA ACT ATT GAT AGA GAT TAC GGA ATC TTC AAC AAC ATC CAT CAC GAC 977 
Thr Thr He Asp Arg Asp Tyr Gly He Phe Asn Asn He His His Asp 
285 290 295 

ATT GGA ACT CAC GTG ATC CAT CAT CTT TTC CCA CAA ATC CCT CAC TAT 1025 
He Gly Thr His Vol He His His Leu Phe Pro Gin He Pro His Tyr 
300 305 310 
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CAC TTG GTC GAT GCC ACG AGA GCA GOT AAA CAT GIG TTA GGA AGA TAG 1073 
His Leu Vol Asp Alo Thr Arg Ala Alo Lys His Vol Leu Giy Arg Tyr 
315 320 325 

TAG AGA GAG CCG AAG ACG TCA GGA GCA ATA CCG ATT CAC TTG GTG GAG 1121 
Tyr Arg Giu Pro Lys Thr Ser Gly Alo lie Pro He His Leu Vol Giu 
330 335 340 345 

AGT TTG GTC GCA AGT ATT AAA AAA GAT CAT TAG GTC ACT GAC ACT GGT 1169 
Ser Leu Vol Alo Ser lie Lys Lys Asp His Tyr Vol Ser Asp Thr Gly 
350 355 360 

GAT ATT GTC TTG TAG GAG AGA GAT CCA GAT GTC TAG GTT TAT GCT TCT 1217 
Asp lie Vol Phe Tyr Giu Thr Asp Pro Asp Leu Tyr Vol Tyr Alo Ser 
365 370 375 

GAC AAA TCT AAA ATC AAT TAACTTTTCT TCCTAGCTCT ATTAGGAATA 1265 
Asp Lys Ser Lys I le Asn 

380 - 

AACACTGCTT' CTCTTTTACT. TAJTTGJTTC TGCTTTAAGT TTAAAATGTA CTCGTGAAAC 1325 
CTTTTTTTTA TTAATGTATT TACGTTAC 1353 



FIG.3C 
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Met Vol Vol Alo Met Asp Gin Arg Ser Asn Vol Asn Gly Asp Ser Gly 
15 10 15 

Alo Arg Lys Glu Glu Gly Phe Asp Pro Ser Aio Gin Pro Pro Phe Lys 
20 25 30 

He Gly Asp lie Arg Alo Aio He Pro Lys His Cys Trp Vol Lys Ser 
35 40 45 

Pro Leu Arg Ser Met Ser Tyr Vol Thr Arg Asp He Phe Alo Vol Alo 
50 55 60 

Alo Leu Alo Met Alo Alo Vol Tyr Phe Asp Ser Trp Phe Leu Trp Pro 
65 70 75 80 

Leu Tyr Trp Vol Alo Gin Gly Thr Leu Phe Trp Alo He Phe Vol Leu 
85 90 95 

Gly His Asp Cys Gly His Gly Ser Phe Ser Asp He Pro Leu Leu Asn 
100 105 110 

Ser Vol Vol Gly His He Leu His Ser Phe He Leu Vol Pro Tyr His 
115 120 125 

Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His Vol 
130 135 140 

Glu Asn Asp Glu Ser Trp Vol Pro Leu Pro Glu Lys Leu Tyr Lys Asn 
145 150 155 160 

Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Vol Pro Leu Pro Mel 
165 170 175 

Leu Alo Tyr Pro He Tyr Leu Trp Tyr Arg Ser Pro Gly Lys Glu Gly 
180 185 190 

Ser His Phe Asn Pro Tyr Ser Ser Leu Phe Alo Pro Ser Glu Arg Lys 
195 200 205 

Leu He Alo Thr Ser Thr Thr Cys Trp Ser He Mel Leu Alo Thr Leu 
210 215 220 

Vol Tyr Leu Ser Phe Leu Vol Asp Pro Vol Thr Vol Leu Lys Vol Tyr 
225 230 235 240 

Gly Vol Pro Tyr He He Phe Vol Mel Trp Leu Asp Alo Vol Thr Tyr 
245 250 255 
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Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr Arg Gly Lys 
260 265 270 

Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr lie Asp Arg Asp Tyr 
275 280 285 

Gly He Phe Asn Asn He His His Asp He Gly Thr His Vol He His 
290 295 300 

His Leu Phe Pro Gin He Pro His Tyr His Leu Vol Asp Alo Thr Arg 
305 310 315 320 

Alo Alo Lys His Vol Leu Gly Arg Tyr Tyr Arg Glu Pro Lys Thx Ser 
325 330 335 

Gly Alo He Pro He His Leu Vol Glu Ser Leu Vol Alo Ser He Lys 
340 345 350 . 

Lys Asp His Tyr Vol Ser Asp Thr Gly Asp He Vol Phe Tyr Glu Thr 
355 360 365 

Asp Pro Asp Leu Tyr Vol Tyr Alo Ser Asp Lys Ser Lys He Asn 
370 375 380 
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RECTIFIED SHEET (RULE 91) 
ISA/EP 



wo 94/18337 



PCTAJS94/01321 



8/25 



10 20 30 40 50 60 

BND3 .AMI RSNVNGDSGARKEEGFOPSAQPPFK IGD 1 RAAI PKHCWVKSPLRSMSYVTRD IFAVAALA 



OESA.AMI MTATIPPLTPTVTPSNPDRPI ADLKLQDI IKTLPKECFEKKASKAWASVL ITLGAIAVGY 

10 20 30 40 50 60 

70 80 90 100 110 120 

BND3 . AM I MAAVYFDSWFLWPL YWVAQGTLFWA I FVLGHDCGHGSFSD I PLLNSWGHI LHSF I L VPY 
.. .:. .X. :. :. :: . . ::.:::::: . . .. :. 

OESA.AMI LCI lYL-PWYCLPITWIWTGTALTGAFWGHDCGHRSFAKKRWVNDLVGHIAFAPLIYPF 

70 80 90 100 110 

130 140 150 160 170 180 

BND3 . AM I HGWR I SHRTHHQNHGHVENDESWVPLPEKL YKNLPHSTRMLRYTVPLPH-LAYP I YLWYR 

• ••••• , 

OESA.AMI HSWRLLHDHHHLHTNKiEVDNAWDFWSVEAFQASPAIVRLFYRAiw^^^ 
120 130 140 150 160 170 

190 200 210 220 230 240 

BND3 .AMI SPGKEGSHFNPYSSLFAPSERKL I ATSTTCWS I ML ATL VYLSFL VDP-V-TVLKVYGVPY 

•• •••• • , •V/" • 

DESA . AM I SLMHFK— LSNFAQRDRNKVKLS I AV-VFLF AA I AFPAL 1 1 TTGVWGFVKFWLMPW 

180 190 200 210 220 230 

250 260 270 280 290 300 

BNDS.AMI I IFNfli^WLDAVTYLHHHGHDEKLPWYRGKEVIISYLRGGL-TTIDRDYGIFNNIH-HDIGTHV 

DESA . AMI L VYHFIAMSTFT I VHHT I PE I RF— RPMOlisAAEAOLNGT WCDYPRWVE VLCHD i NVH i 

240 250 260 270 280 

310 320 330 340 350 360 

BND3 .AMI I HHLFPOI PHYHL VDATRAAKHVLGRYYREPKTSGAI PIHLVESLVAS IKKDHYVSDTGD 

OESA.AMI PHHLSVAIPSYNLRLAHGSLKENVIGPFLYERTFNWQLMQQISGQCHLYDPEHGYRTFGSL 
290 300 310 320 330 340 

BND3.AMI IVF 

OESA.AMI KKV 
350 
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GGAAAACACA AGTTTCTCTC ACACACATTA TCTCTTTCTC TATTACCACC ACTCATTCAT 60 

AACAGAAACC CACCAAAAAA TAAAAAGAGA GACTTTTCAC TCTGGGGAGA GAGCTCAAGT 120 

TCTA ATG GCG AAC TTG GTC TTA TCA GAA TGT GGT ATA CGA CCT CTC CCC 169 
Mel Alo Asn Leu Vol Leu Ser Glu Cys Gly He Arg Pro Leu Pro 
1 5 10 15 

AGA ATG TAG ACA ACA CCC AGA TCC AAT TTG CTC TCC AAC AAC AAC AAA 217 
Arg He Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys 
20 25 30 

TTC AGA CCA TCA CTT TOT TCT TCT TGT TAC AAA ACA TCA TCA TCT CCT 265 
Phe Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser Ser Pro 
35 40 45 

CTG TCT TTT GGT CTG AAT TCA CGA GAT GGG TTC ACQ AGG AAT TGG GCG 313 
Leu Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Alo 
50 55 60 

TTG AAT GTG AGC ACA CCA TTA ACG ACA CCA ATA TTT GAG GAG TCT CCA 351 
Leu Asn Vol Ser Thr Pro Leu Thr Thr Pro lie Phe Glu Glu Ser Pro 
65 70 75 

TTG GAG GAA GAT AAT AAA CAG AGA TTC GAT CCA GGT GCG CCT CCT CCG 409 
Leu Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Alo Pro Pro Pro 
80 85 90 95 

TTC AAT TTA GCT GAT ATT AGA GCA GCT ATA CCT AAG CAT TGT TGG GTT 457 
Phe Asn Leu Alo Asp lie Arg Alo Alo He Pro Lys His Cys Trp Vol 
100 105 110 

AAG AAT CCA TGG AAG TCT TTG AGT TAT GTC GTC AGA GAC GTC GCT ATG 505 
Lys Asn Pro Trp Lys Ser Leu Ser Tyr Vol Vol Arg Asp Vol Alo lie 
115 120 125 

GTC TTT GCA TTG GCT GCT GGA GCT GCT TAC CTC AAC AAT TGG ATT GTT 553 
Vol Phe Alo Leu Alo Alo Gly Alo Alo Tyr Leu Asn Asn Trp He Vol 
130 135 140 
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TGG CCT CTC TAT TGG CTC GCT CM GGA ACC ATG TTT TGG GOT CTC TTT 601 
Trp Pro Leu Tyr Trp Leu Alo Gin Gly Thr Met Phe Trp Alo Leu Phe 
145 150 155 

GTT CTT GGT CAT GAC TGT GGA CAT GGT AGT TTC TCA AAT GAT CCG AAG 649 
Vol Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys 
160 165 170 175 

TTG AAC AGT GTG GTC GGT CAT CTT CTT CAT TCC TCA ATT CTG GTC CCA 697 
Leu Asn Ser Vol Vol Gly His Leu Leu His Ser Ser He Leu Vol Pro 
180 185 190 

TAG CAT GGC TGG AGA ATT AGT CAC AGA ACT CAC CAC CAG AAC CAT GGA 745 
Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly 
195 200 205 

CAT GTT GAG AAT GAC GAA TCT TGG CAT CCT ATG TCT GAG AAA ATC TAG . 793 
His Vol Glu Asn Asp Glu Ser Trp His Pro Mel Ser Glu Lys He Tyr 
210 215 . 220 

AAT ACT TTG GAC AAG CCG ACT AGA TTC TTT AGA TTT ACA CTG CCT CTC 841 
Asn Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu 
225 230 235 

GTG ATG CTT GCA TAC CCT TTC TAG TTG TGG GCT CGA AGT CCG GGG AAA 889 
Vol Met Leu Alo Tyr Pro Phe Tyr Leu Trp Alo Arg Ser Pro Gly Lys 
240 245 250 255 

AAG GGT TCT CAT TAC CAT CCA GAC AGT GAC TTG TTC CTC CCT AAA GAG 937 
Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu 
260 265 270 

AGA AAG GAT GTC CTC ACT TCT ACT GCT TGT TGG ACT GCA ATG GCT GCT 985 
Arg Lys Asp Vol Leu Thr Ser Thr Alo Cys Trp Thr Alo Met Alo Alo 
275 280 . 285 

CTG CTT GTT TGT CTC AAC TTC ACA ATC GGT CCA ATT CAA ATG CTC AAA 1033 
Leu Leu Vol Cys Leu Asn Phe Thr He Gly Pro He Gin Met Leu Lys 
290 295 300 
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CTT TAT GGA ATT CCT TAC TGG ATA AAT GTA ATG TGG TTG GAC TTT GTG 1081 
Leu Tyr Gly He Pro Tyr Trp He Asn Vol Met Trp Leu Asp Phe Vol 
305 310 315 

ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG CTT CCT TGG TAC CGT 1 129 
Thr Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arc 
320 325 330 335 

GGC AAG GAG TGG AGT TAC CTG AGA GGA GGA CTT ACA ACA TTG GAT CGT 1177 
Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg 
340 345 350 

GAC TAC GGA TTG ATC AAT AAC ATC CAT CAT GAT ATT GGA ACT CAT GTG 1225 
Asp Tyr Gly Leu He Asn Asn He His His Asp He Gly Thr His Vol 
355 360 365 

ATA CAT CAT CTT TTC CCG CAG ATC CCA CAT TAT CAT CTA GTA GAA GCA 1273 
He His His Leu Phe Pro Gin He Pro His Tyr His Leu Vol Glu Alo 
370 375 380 

ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT TAC AGG GAG CCT GAT 1321 
Thr Glu Alo Alo Lys Pro Vol Leu Gly Lys Tyr Tyr Arg Glu Pro Asp 
385 390 395 

AAG TCT GGA CCG TTG CCA TTA CAT TTA CTG GAA ATT CTA GCG AAA AGT 1369 
Lys Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Alo Lys Ser 
400 405 410 415 

ATA AAA GAA GAT CAT TAC GTG AGC GAC GAA GGA GAA GTT GTA TAC TAT 1417 
He Lys Glu Asp His Tyr Vol Ser Asp Glu Gly Glu Vol Vol Tyr Tyr 
420 425 430 

AAA GCA GAT CCA AAT CTC TAT GGA GAG GTC AAA GTA AGA GCA GAT TGAAATGAAG 1472 
Lys Alo Asp Pro Asn Leu Tyr Gly Glu Vol Lys Vol Arg Alo Asp 
435 440 445 

CAGGCTTGAG ATTGAAGTTT TTTCTATTTC AGACCAGCTG ATTTTTTGCT TACTGTATCA 1532 

ATTTATTGTG TCACCCACCA GAGAGTTAGT ATCTCTGAAT ACGATCGATC AGATGGAAAC 1592 

AACAAATTTG TTTGCGATAC TGAAGCTATA TATACCATAA AAAAAAAAAA AAA 1645 
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Mel Alo Asn Leu Vol Leu Ser Glu Cys Gly He Arg Pro Leu Pro Arg 



He Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys Phe 



Arg Pro Ser Leu Ser Ser. Ser Ser Tyr Lys Thr Ser Ser Ser Pro Leu 
35 40 .45 

Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Alo Leu 
50 55 60 

Asn Vol Ser Thr Pro Leu Thr Thr Pro He Phe Glu Glu Ser Pro Leu 
65 70 75 80 

Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Alo Pro Pro Pro Phe 
85 " 90 95 

Asn Leu Alo Asp He Arg Alo Alo He Pro Lys His Cys Trp Vol Lys 
100 .105 ^110 

Asn Prb'Trp Lys Ser Leu Ser Tyr Vdl Vol Arg Asp Vol Alo He Vol 
115 120 125 

Phe Alo Leu Alo Alo Gly Alo Alo Tyr Leu Asn Asn Trp He Vol Trp 
130 135 ; 140 

Pro Leu Tyr Trp Leu Alo Gin Gly Thr Met Phe Trp Alo Leu Phe Vol 
145 150 155 160 

Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys Leu 
165 170 175 

Asn Ser Vol Vol Gly His Leu Leu His Ser Ser He Leu Vol Pro Tyr 
180 185 190 

His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His 
195 200 205 

Vol Glu Asn Asp Glu Ser Trp His Pro Mel Ser Glu Lys He Tyr Asn 
210 215 220 
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Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu Vol 
225 230 235 240 

Met Leu Alo Tyr Pro Phe Tyr Leu Trp Alo Arg Ser Pro Gly Lys Lys 
245 250 255 

Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu Arg 
260 265 270 

Lys Asp Vol Leu Thr Ser Thr Alo Cys Trp Thr Alo Met Alo Alo Leu 
275 280 285 

Leu Vol Cys Leu Asn Phe Thr lie Gly Pro He Gin Met Leu Lys Leu 
290 295 300 

Tyr Gly lie Pro Tyr Trp lie Asn Vol Met Trp Leu Asp Phe Vol Thr 
305 310 315 320 

Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arg Gly 
325 330 335 

Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg Asp 
340 345 350 

Tyr Gly Leu lie Asn Asn lie His His Asp He Gly Thr His Vol lie 
355 360 365 

His His Leu Phe Pro Gin lie Pro His Tyr His Leu Vol Glu Alo Thr 
370 375 380 

Glu Alo Alo Lys Pro Vol Leu Gly Lys Tyr Tyr Arg Glu Pro Asp Lys 
385 390 395 400 

Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Alo Lys Ser He 
405 410 415 

Lys Glu Asp His Tyr Vol Ser Asp Glu Gly Glu Vol Vol Tyr Tyr Lys 
420 425 430 

Alo Asp Pro Asn Leu Tyr Gly Glu Vol Lys Vol Arg Alo Asp 
435 440 445 
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AGAGAGTGCA AATAGAACGA CAGAGACTTT TTCCTCTTTT CTTCTTGGGA AGAGGCTCCA 60 

ATG GCG AGC TCG GTT TTA TCA GAA TGT GOT TTT AGA OCT CTC CCC AGA . 108 
Met Alo Ser Ser Vol Leu Ser Glu Cys Gly Phe Arg Pro Leu Pro Arg 
1 5 10 15 

TTC TAG COT AAA GAG ACA ACC TCT TTT GCC TCT AAC CCT AAA CCC ACT 156 
Phe Tyr Pro Lys His Thr Thr Ser Phe Alo Ser Asn Pro Lys Pro Thr 
20 25 30 

TTC AAA TTC AAT CCA CCA CTT AAA CCT CCT TCT TCT CTT CTC AAT TCC 204 
Phe Lys Phe Asn Pro Pro Leu Lys Pro Pro Ser Ser Leu Leu Asn Ser 
35 40 45 

CGA TAT GGA TTC TAG TCT AAA ACC AGG AAC TGG GCA TTG AAT GTG GCA 252 
Arg Tyr Gly Phe Tyr Ser Lys Thr Arg Asn Trp Alo Leu Asn Vol Alo 
50 55 60 

ACA CCT TTA ACA ACT CTT GAG TCT CCA TCC GAG GAA GAC ACG GAG AGA 300 
Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Thr Glu Arg 
65 70 75 . 80 

TTC GAC CCA GGT GCG CCT CCT CCC TTC AAT TTG GCG GAT ATA AGA GCA .348 
Phe Asp Pro Gly Alo Pro Pro Pro Phe Asn Leu Alo Asp He Arg Ala 
85 90 95 

GCC ATA CCT AAG CAT TGT TGG GTT AAG AAT CCA TGG ATG TCT ATG AGT 396 
Alo lie Pro Lys His Cys Trp Vol Lys Asn Pro Trp Met Ser Met Ser 
100 105 110 

TAT GTT GTC AGA GAT GTT GGT ATC GTC TTT GGA TTG GCT GCT GTT GGT 444 
Tyr Vol Vol Arg Asp Vol Alo He Vol Phe Gly Leu Alo Alo Vol Alo 
115 120 125 

GCT TAC TTC AAC AAT TGG CTT CTC TGG CCT CTC TAG TGG TTC GCT CAA 492 
Alo Tyr Phe Asn Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Alo Gin 
130 135 140 

GGA ACC ATG TTC TGG GCT CTC TTT GTC CTT GGG CAT GAC TGG GGA CAT 540 
Gly Thr Met Phe Trp Alo Leu Phe Vol Leu Gly His Asp Cys Gly His 

145 150 155 160 
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GGT AGC TTC TCG AAT GAT CCG AGG CTG AAC AGT GTG GCT GGT CAT CTT 588 
Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Vol Alo Gly His Leu 
165 170 175 

CTT CAT TCC TCA ATT CTG GTC CCT TAC CAT GGC TGG AGG ATT AGC CAC 636 
Leu His Ser Ser He Leu Vol Pro Tyr His Gly Trp Arg lie Ser His 
180 185 190 

AGA ACT CAC CAC GAG AAC CAT GGT CAT GTC GAG AAT GAG GAA TCA TGG 684 
Arg Thr His His Gin Asn His Gly His Vol Glu Asn Asp Glu Ser Trp 
195 200 205 

CAT CCT TTG CCT GAA AGC ATC TAC AAG AAT TTG GAA AAG ACG ACT CAA 732 
His Pro Leu Pro Glu Ser He Tyr Lys Asn Leu Glu Lys Thr Thr Gin 
210 215 220 

ATG TTT AGG TTT ACA CTG CCT TTT CCA ATG CTG GCA TAC CCT TTC TAC 780 
Mel Phe Arg Phe Thr Leu Pro Phe Pro Mel Leu Alo Tyr Pro Phe Tyr 
225 230 235 240 

TTG TGG AAC AGA AGT CCA GGG AAA CAA GGT TCT CAT TAT CAT CCG GAG 828 
Leu'Trp Asn Arg Ser Pro Gly Lys Gin Gly Ser His Tyr His Pro Asp 
245 250 255 

AGT GAG TTG TTT CTT CCA AAA GAG AAG AAA GAT GTT CTG ACA TCA ACT 876 
Ser Asp Leu Phe Leu Pro Lys Glu Lys Lys Asp Vol Leu Thr Ser Thr 
260 265 270 

GCC TGT TGG ACT GCA ATG GCT GCT TTG CTT GTT TGT CTG AAC TTT GTC 924 
Alo Cys Trp Thr Alo Mel Alo Alo Leu Leu Vol Cys Leu Asn Phe Vol 
275 280 285 

ATG GGT CCA ATC CAG ATG CTG AAA CTA TAT GGC ATC CCT TAT TGG ATA 972 
Mel Gly Pro He Gin Mel Leu Lys Leu Tyr Gly He Pro Tyr Trp He 
290 295 300 

TTT GTA ATG TGG TTG GAG TTC GTC ACT TAC TTG CAC CAC CAT GGA CAT 1020 
Phe Vol Mel Trp Leu Asp Phe Vol Thr Tyr Leu His His His Gly His 
305 310 315 320 
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GAA GAC AAG CTC CCT TGG TAT CGT GGA AAG GAA TOG AGT TAG CTG AGA 1068 
Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg 
325 330 335 

GGA GGG CTC ACA ACA TTA GAT CGT GAC TAC GGA TGG ATC AAT AAC ATC 1116 
Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp He Asn Asn He 
340 345 350 

GAC CAC GAT ATT GGA ACT CAT GTG ATA CAT CAT CTT TTC CCG CAG ATC 1164 
His His Asp He Gly Thr His Vol He His His Leu Phe Pro Gin He 
355 360 365 

CCA CAT TAT CAT CTA GTA GAA GCA ACA GAA GCA GCT AAA CCA GTA CTA 1212 
Pro His Tyr His Leu Vol Glu Alo Thr Glu Alo Alo Lys Pro Vol Leu 
370 375 380 

GGA AAG TAC TAC AGA GAA CCG AAA AAC TCT GGA CCT CTG CCA CTT CAC 1260 
Gly Lys Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro Leu His 
385 . 390 395 400 - 

TTA CTG GGA AGC CTC ATA AAG AGT ATG AAA CAA GAC CAT TTC GTA AGC - 1308 
Leu Leu Gly Ser Leu He. Lys Ser litet Lys Gin Asp His Phe Vol Ser :-. - 
405. 410 415 

GAT ACA GGA GAT GTC GTG TAC TAT GAG GCA GAT CCA AAA CTC AAT GGA 1356 
Asp Thr Gly Asp Vol Vol Tyr Tyr Glu Alo Asp Pro Lys Leu Asn Gly 
420 425 430 

CAA AGA ACA TGAGGACATA CTGCABTGAA CCAGGCAGAC AAGTTACATA 1405 
Gin Arg Thr 
435 

AATTCATCTT GGCCCATTCA TTATGTTCTT TTTGTTTTGG TGTAAAGCCT TTTOGAGATT 1465 
AAAAAAGCAT TAATTTGTAG AAACCTGTGG TAAAACTCTC GATCAAATGA AATAAGATAT 1525 
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Mel Ala Ser Ser Vol Leu Ser Glu Cys Gly Phe Arg Pro Leu Pro Arg 
15 10 15 

Phe Tyr Pro Lys His Thr Thr Ser Phe Aid Ser Asn Pro Lys Pro Thr 
20 25 30 

Phe Lys Phe Asn Pro Pro Leu Lys Pro Pro Ser Ser Leu Leu Asn Ser 
35 40 45 

Arg Tyr Gly Phe Tyr Ser Lys Thr Arg Asn Trp Alo Leu Asn Vol Alo 
50 55 60 

Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Thr Glu Arg 
65 70 75 80 

Phe Asp Pro Gly Alo Pro Pro Pro Phe Asn Leu Alo Asp He Arg Alo 
85 90 95 

Ale He Pro Lys His Cys Trp Vol Lys Asn Pro Trp Met Ser Met Ser 
100 105 110 

Tyr Vol Vol Arg Asp Vol Alo lie Vol Phe Gly Leu Alo Alo Vol Alo 
115 120 125 

Alo Tyr Phe Asn Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Alo Gin 
130 135 140 

Gly Thr Met Phe Trp Alo Leu Phe Vol Leu Gly His Asp Cys Gly His 
145 150 155 160 

Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Vol Alo Gly His Leu 
165 170 175 

Leu His Ser Ser He Leu Vol Pro Tyr His Gly Trp Arg He Ser His 
180 185 190 

Arg Thr His His Gin Asn His Gly His Vol Glu Asn Asp Glu Ser Trp 
195 200 205 

His Pro Leu Pro Glu Ser He Tyr Lys Asn Leu Glu Lys Thr Thr Gin 
210 215 220 
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Met Phe Arg Phe Thr Leu Pro Phe Pro Met Leu Alo Tyr Pro Phe Tyr 
225 230 235 240 

Leu Trp Asn Arg Ser Pro Gly Lys Gin Gly Ser His Tyr His Pro Asp 
245 250 255 

Ser Asp Leu Phe Leu Pro Lys Glu Lys Lys Asp Vol Leu Thr Ser Thr 
260 265 270 

Alo Cys Trp Thr Alo Met Alo Alo Leu Leu Vol Cys Leu Asn Phe Vol 
275 280 285 

Mel Gly Pro He Gin Met Leu Lys Leu Tyr Gly lie Pro Tyr Trp lie 
290 295 300 

Phe Vol Met Trp Leu Asp Phe Vol Thr Tyr Leu His His His Gly His 

305 :. r : 310 . -. 315 . - - 320 

Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg 
- 325 : 330 335 



Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp He Asn Asn lie 
340 345 350 

His His Asp I le Gly Thr His Vol 1 le His His Leu Phe Pro Gin 1 le 
355 360 365 

Pro His Tyr His Leu Vol Glu Alo Thr Glu Alo Alo Lys Pro Vol Leu 
370 375 380 

Gly Lys Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro Leu His 
385 390 395 400 

Leu Leu Gly Ser Leu lie Lys Ser Met Lys Gin Asp His Phe Vol Ser 
405 410 415 

Asp Thr Gly Asp Vol Vol Tyr Tyr Glu Alo Asp Pro Lys Leu Asn Gly 
420 425 430 

Gin Arg Thr 
435 



FIG. 13 b "^^^'^'--.DJI^ 



INTERNATIONAL SEARCH REPORT 



late* ttl AppUcatioD No 

PCT/US 94/01321 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 5 C12N15/82 C12N15/53 C12N15/11 C12N5/10 A01H5/00 
CllBl/00 

Accordipg to totemational Patent aaaafication (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 


Minimum documaiation searched {dasnfication system followed by das 


sification ^yihbe 


As) 


IPC 5 C12N AOIH CUB 






PoCTimrntation cearched other than minimum documentauon to the exten 


t that such docu 


nems are included in the fields searched 



Electronic data base consulted durmg the tnteniationai search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Gitatian of document, with indication, where appropriate, of the relevant 



Rdcvuit to claim No. 



SCIENCE 

vol . 258 , 20 November 1992 . LANCASTER, 
PA US 

pages 1353 - 1355 

ARONDEL» v.. ET AL. 'Map-based cloning of 
a gene controlling omega-3 fatty acid 
desaturation in Arabidopsis' 
see the whole document 



37 



1.2.4,8. 

9.11.17, 

18,22, 

23,25, 

29,30, 

32,38 



Fuithcr documatts are listed in the oonluua&on of box C. 



m 



Patent family mcmbos are listed in annex. 



* Speaal categories of cited documents : 

'A' document defiiung the general state of the art which is not 
conadered to be of particular relevance 

*E' earlier document but published on or after the international 
fihngdate 

document which may throw doubts on pnonty claiin(s) or 
which IS ated to establish the publication date of another 
dtaiton or other speaal reason (as specified) 

docimient referring lo an oral disdosure, use, exhibiuon or 
other means 

P* document pUbbdicd pnor to the mtemational filing date but 
later than the pnonty date claimed 



n** later document published alter the intemahonal filing date 
or pnonty date and not m oonflict with the appUcahon tnit 
ated to undentind the principle or theory underlying the 
tnvenhon 

*X* document of particular relevance; the claimed mweaiion 
cannot be considered novel or cannot be consideRd to 
involve an mventive step when the document is taken alooe 

"Y* document of particular relevance; the claimed invenuon 
cannot be considered to involve an mventive step when the 
document is combined with one or more other such docu- 
ments, such combination being <^bvious to a person skilled 
m the an. 

'ft* document monber of the same patent family 



Date of the actual completion of the mtemaaonal search 

1 June 1994 


Date of mailing of the mtemational search report 

J 4 -06- 


Name and mailmg address of the ISA 

European Patent Office, P.B. SSI 8 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. 31-70) 340-2040, Tx. 31 651 cponi. 
Far (-^ 31.70) 340-3016 


Authofued officer 

Maddox, A 



page 1 of 3 



INTERNATIONAL. SEARCH REPORT 


IntQ lAl Application No 

PCT/US 94/01321 


C^Conniwu 


ition) DOCUMENTS CONSIDERED TO BE RELEVANT . 






Catefary * 


Gtuon of with mdioiaon, where appnpnate, of the relevant paaacei 


Relevant to claim No. 


X 

Y 


US. A, 5 057 419 (MARTIN) 15 October 1991 
see column 6, line 40 - column 6, line 66 

see column 9. line ~ column lu. nne 9o 




20,40 

1.2,4,8, 

9 11 17. 

18.22. 

23,25, 

29,30. 

32,38 


Y 


WO. A, 91 13972 (CALGENE) 19 September 1991 
see the whole document 




1,2,4,8. 

9,11,17, 

18,22, 

23.25, 

29.30, 

32.38 


P.X 


JOURNAL OF BIOLOGICAL CHEMISTRY 

vol. 268, no. 32 , 15 November 1993 , 

nil T^\Af\T\t^ kin lie 

BALTIMORE, MD u5 
pages 24099 - 24105 
IBA, K., FT AL. "A gene encoding a 
chloroplast omega-3- fatty add desaturase 
complements alterations in fatty acid 
desaturation and chloroplast copy numbers 
of the fad? mutant of Arabidopsis 
- thai iana' 
see the whole document.. 




1.2.5, 
22,23. 
26,37 


P.X 

- 


PLANT PHYSIOLOGY. 

vol. 103 , October 1993 . ROCKVILLE. MD, 
USA. 

pages 467 - 476 

YADAV» N.5., cT AL. Cloning OT mgner 
plant omega-3- fatty acid desaturases* 
see the whole document 




1,2,17, 
90 n 


P.X 


WO, A, 93 11245 (DU PONT) 10 June 1993 
see the whole document 




1.2.8.9. 
17.22. 

30.37.38 


P.X 


WO, A, 93 06712 (RHONE-POULENC AGROCHIMIE) 
15 April 1993 




1.2.22, 
23,37 


A 


PLANT PHYSIOLOGY. 

vol. 100 . 1992 , ROCKVILLE, MD, USA. 
pages 894 - 901 

POLASHOCK. J.J., ET AL. 'Expression of the 
yeast delta-9 fatty acid desaturase in 
Nicotina tabacum* 
see the whole document 




1.22 



Fom PCTASA/aiO (sonliauaboa of weontf «hMt) (July 1993) 



page 2 of 3 



IfTIERNATIONAI. SEARCH REPORT 



bme. jud Applicalion No 

PCT/US 94/01321 



C^Contuuuoon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Ottuon of document, with indicabon, where appropnate. of the relevant passages 



Relevant to claim No. 



ANN. REV. PLANT PHYSIOL. PUNT MOL. BIOL, 
vol. 42 , 1991 
pages 467 - 506 

BROWSE, J.. ET AL. 'Glycerolipid 
synthesis: Biochemistry and regulation' 
see the whole document 

UCU SYMP. MOL. CELL BIOL, NEW SER. 
vol. 129 , 1990 
pages 301 - 309 

BROWSE. J., ET AL. 'Strtegies for 
modifying plant lipid composition' 
see page 306 

NL,A,9 002 130 (STICHTING TECHNISCHE 
WETENSCHAPPEN UTRECHT) 16 April 1992 
see the whole document 



1-40 



1.22 



1-40 



Fom PCT/ISA/310 <eDnttnuatiea of CMond thnt) (July 1M9) 



page 3 of 3 



